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FOREWORD 


Inventory control is a major field in OR. Interest in the problems of optimal stock 
management at a scientific level goes back to the start of the 20th century. The most 
important impulse, however, came after the 2nd World War when scientists of the 
caliber of Jacob, Marschak, Kenneth Arrow, Samuel Karlin among others looked into 
the problem of optimal stocking under stochastic demand. It was characteristic of this 
discipline, that methods of solving problems of this type were developed first before the 
necessary commercial electronic data processing for their ready application were 
available. 

The importance of inventory control in business increased dramatically with the 
increasing interest rates of the 70s. It was the rule of the hour to release surplus 
operating capital tied up in excessive inventories and to use the resulting liquidity to 
finance new investments. There arose a need for intelligent solutions to the problem. 
Unfortunately, OR experts and software applications developers were following 
separate paths. Hence, the opportunity to find the best solutions, using the combined 
potentials of theory, problem analysis and experience, was not exploited. 

Today, we stand before the development and realization of sophisticated CIM concepts 
and it is imperative to reset the course. This book is a contribution to this effort. It 
shows how inventory control, with the help of OR, can be rationally structured. It is 
understandable that a complete treatment of this vast material is not possible and it is 
not intended. The book limits itself to standard models and the important 
ramifications. Special emphasis is given to fundamentals. On one hand, the reader is 
shown how the models are appropriately formulated and extended for special problems. 
On the other hand, the needed mathematical derivations are completely and 
comprehensively described so that, using the methods learned, the reader will be able 
to work out his own models not treated in this book. 

Numerical methods for solving problems with random demand in complicated cases 
were given emphasis in this book. The important algorithms are thoroughly treated so 
that special situations can be handled. 
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The book is intended for OR practitioners and economists, as well as for information 
managers who are engaged in the development of modern computerized inventory 
systems, working for corporations, software houses or computer manufacturers. 

The authors were actively supported in the writing of this book. Ingrid Riedlbeck and 
Susanne Spielvogel, both mathematics degree holders, patiently checked the 
mathematical derivations and reviewed them in detail. Mr. Robert Hackl read the 
proofs for the whole text and made the diagrams. Mrs. Karola Treiber and Mrs. 
Bernardy Schwarzwalder did the typing. Maria Luisa and Roberto Asuncion 
translated the German text, prepared the charts and typed the English manuscript. 
Finally, we would like to acknowledge the contribution of Prof. James Pope for his 
valuable comments and corrections to the English edition. To all we express our 
sincere gratitude. 

St. Gallen/Munich, April 1992 

Dieter Bartmann Martin J. Beckmann 



OVERVIEW 


The book is divided into six chapters. Chapter 1 deals with inventory where demand is 
deterministic. Chapters 2-5 consider cases of stochastic demand. Chapter 6 is 
devoted to computational procedures. 

CHAPTER 1 follows a historical line. The lot size model of Wilson (or Harris or 
Andler) is presented after a short introduction (Section 1). Although the assumptions 
in these models are of the simplest type, the derived formula for the optimal lot size 
proves to be rather robust in many practical situations, e.g., in the transition from a 
constant rate of demand to the Poisson demand (as will be shown in Chapter 2). 

Costs and sensitivity are discussed in Section 3. It is shown that these are decreasing 
with scale when the ordering rule is optimal, i.e., with increasing turnover, the cost per 
unit of inventory becomes smaller. Under decentralization, this effect of increasing 
returns to scale is partially lost. The appropriate formulas are derived. Using 
sensitivity analysis, the effects of various parameters on the total expected cost are 
discussed; first, when the rate of demand or the various costs are wrongly estimated; 
secondly, when an optimal order quantity is not realizable due to special packaging 
units or container size conditions; or, thirdly, when the desired period length between 
orders is predetermined because of internal business organizational reasons or because 
of prescribed delivery dates as in the case of the pharmaceutical industry. 

The next two sections, Section 4 and Section 5, deal with multi—item inventories. 
Section 4 discusses the theoretical foundation of a classification for an " ABC-analysis" 
with respect to turnover volume and prices. Section 5 considers the question of stock 
maintenance. How high must the demand rate be in order to make it worthwhile to 
keep an article in stock at all? 

To apply scientific inventory control it is important to have, as much as possible, an 
estimate of demand. Unfortunately, sales figures are often aggregated (monthly, 
quarterly, yearly) and disaggregated values are sometimes not readily available. It is, 
therefore, shown in Section 6 how the demand rate can be estimated from order data. 



VIII 


How does the optimal inventory policy change when the objective of the firm is profit 
maximation rather than minimization of inventory costs? In this connection, the 
question also comes up of how the inventories of a firm should be evaluated. These 
questions are examined in Sections 7 and 8. 

The standard model needs modification in cases of quantity discounts. Two cases are 
discussed in Section 9: (a) the quantity discount is given only to the extent that the 
order exceeds a cutoff point and (b) it is granted for the whole order quantity when the 
cutoff point is exceeded. 

In Section 10, we examine when a collective order is more advantageous than single 
orders. Up to this point, sales inventory or raw materials inventory have been 
discussed. In Section 11 production inventory or finished goods inventory for internal 
production is considered . How big is the optimal lot size in production with a 
continuous demand at a constant rate? 

The consequences of inventory shortages are discussed in Section 12. For businesses 
with a monopolistic character, demand is not lost even when there are delivery 
bottlenecks (the so-called backorder case). Even then, however, an inventory deficit 
will cost something since profits can only be realized later. It turns out that shortages 
may be perfectly profitable. The optimal ordering cycles and order quantities can then 
be calculated. 

Discrete lot sizes are handled in Section 13. This is especially important for small lots 
and for goods with low demand. 

In Section 14, warehouse shelf space is considered. In the first case, a fixed shelf space 
is reserved for each good. In the second, the order period between two goods is 
staggered in such a way that the maximum shelf space is kept as small as possible. In 
addition to space limitations, budget constraints can be in effect. The question of 
limited space and/or budget limitations on the optimal order quantity is considered in 
Section 15. 



IX 


In Section 16, we consider a " rolling demand horizon” where only the demand in the 
next period is exactly known but unknown thereafter. In Section 17, a fixed delivery 
schedule is considered. Once more the question arises as to when it is worthwhile to 
keep goods in stock and when it is more advantageoous to sell on order. 

In Section 18, the random deviations from a fixed delivery schedule are considered and 
an optimal system of" Just-in—Time" is determined. 

CHAPTER 2 extends the simple Wilson inventory model to cases of random demand. 
A first example is a Poisson process. (Further generalizations, e.g., to random delivery 
schedules or distributed demand, are treated in Chapter 4). Sections 19 and 20 give an 
introduction to the Poisson process together with some generalizations and present the 
decision criterion to be used for decision making under risk. Section 21 deals with 
continuous interest payment and infinite payment flows. In Sections 22 and 23, the 
inventory model with Poisson demand in the discounted and non-discounted case is 
formulated as a Dynamic Program using Bellman’s "Principle of Optimality”. The 
model is further generalized in Section 24 to handle the case of randomly dependent 
demand, a Semi—Markov process. Section 25 demonstrates the use of policy iteration 
of dynamic programming to show that even for stochastic demand, the optimal order 
quantity is identical to the Wilson lot size of the deterministic model. 

In CHAPTER 3, single period models are discussed. This type of "inventory" problem 
occurs, for example, in fashion articles or ticket sales or in preparing for an expedition. 
The basic model known as the newsboy problem is presented in Section 26. In this 
connection, we determine when it is profitable to enter into a single period business at 
all. 

The dependence of the optimal lot size on the parameters of the demand distribution 
and on the inventory and shortage costs are discussed in Section 27. With the use of 
entropy, it is shown that the single period cost increases as the difference between 
inventory and shortage costs becomes smaller. In Sections 28 and 29, we look into the 
optimal period length. The "overbooking of reservations" is similar to the newsboy 
problem. Since it seldom happens that all reservations are actually claimed, it pays off 
to sell a certain part of a reserved quota a second time. 



X 


CHAPTER 4 treats the stochastic demand model under continuous monitoring. Two 
approaches are dynamic programming and the Method of State Probability. The latter 
is explained in Section 31 and is applied to a model with geometrically distributed 
demand and to a model with Poisson demand and exponential delivery time. Here we 
also consider the possibility that the inventory cost depends on the maximum stock 
level. This occurs, for instance, when storage space must be rented. 

The emphasis of Sections 33-35 is on delivery times. Supplier reliability is an 
important factor under competition. In many cases one can therefore look at the 
delivery time as certain. In Section 33, a model with fixed delivery time is discussed. 
In a monopolistic situation or where the goods are allocated, the uncertainty occurs not 
so much in demand but in the delivery times. This is especially observable in 
developing countries. The situation of internal production or just—in—time deliveries is 
specially treated here because delays in delivery are critical. 

CHAPTER 5 examines stochastic inventory models with periodic monitoring. Even 
though continuous inventory monitoring is now a common practice, many businesses 
still use periodic inspection and order decisions. 

Periodic models occur also when arrangements have been made with suppliers for 
deliveries at specific times. The basic Arrow-Harris—Marschak (AHM) model with a 
finite (Section 36) and infinite (Section 37) planning horizon is formulated. This model 
is normalized to a standard format in Section 38. Optimal order policies for different 
expected values and standard deviations of the demand distribution can then be 
derived directly from the optimal solution of the standard model. 

A special model (Section 39) is the AHM-Model with exponentially distributed 
demand, the period analog to the continuous model with Poisson demand. It is solved 
explicitly. 

The optimality of the (s, S)-policy is examined in Sections 40-44. A method of 
calculating s and S is given for a special case. In Section 45 the model with a single 
period delivery lag is formulated. It is shown how this model fits into the framework 
of the AHM-Model. An interesting result is that the inventory fluctuation of a model 
with positive delivery time is greater than for models with instantaneous delivery. 





XI 


In general, a lengthening of the delivery time makes inventory control more expensive. 

The normal assumption of a stationary demand process is not always realized in 
practice; demand levels are subject to fluctuations over time. Information about the 
immediate past may be available from which a short term forecast can be made. This 
information should be considered in the model. This is done in the following two 
sections. Autocorrelated demand is assumed in Section 46. In Section 47, endogeneous 
and exogeneous forecast mechanisms are introduced into the model. Exponential 
smoothing is given as an example. 

Special considerations are given to goods which have a normally distributed demand, 
have small rates of market growth and whose turnover is forecast with the help of 
exogeneous variables, such that successive forecasts are not autocorrelated. If, for 
example, the exogeneous variable is given by the change in the gross national product, 
then this approach is suitable for goods which are subject to the acceleration principle, 
e.g. investment goods or spare parts. 

So far we have attempted to derive formulas for the optimal lot size and ordering rules. 
In cases where this is not possible, one can lean on the computational method of 
dynamic programming. This is the content of Chapter 6. In Section 48, the method of 
value iteration is developed. It is the most general method of dynamic programming 
and can be applied to inventory models which, due to their complicated cost structure, 
deviate too much from the studied basic models. The advantages and weaknesses of 
this method are shown and a possibility of shortening the computation time is given. 

Policy iteration is presented in Section 49. It is an alternative to value iteration in the 
inventory problem with infinite planning horizon. For this type of problem, value and 
policy iteration can be combined into a third method, the so-called policy—value 
iteration. This is, however, not considered here because the method of bisection in 
connection with dynamic programming introduced in Section 50 promises to be better. 



XII 


In the last section, the AHM-Model is specially considered for the backorder case j 
without discounting. For this model, a standard form was derived (Section 38) which j 
is, however, subject to a limitation with respect to the assumed distribution of 
demand. It is, therefore, important that fast methods of computation are available for 
models with a general demand distribution. One such method was developed by 
Federgruen and Zipkin. It is discussed in Section 51. 
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CHAPTER T 

DETERMINISTIC INVENTORY MODELS 


§1 INTRODUCTION 

J.M. KEYNES differentiated three motives for holding money which can be applied to 
inventory problems. 

1. The Transaction Motive 

Since outflows are not synchronized with inflows, stocks are needed to 
bridge these discrepancies. Usually, incoming goods arrive in greater 
quantities and in longer time intervals than outgoing ones. 

2. The Precautionary Motive 

If an order is placed, one must maintain reserve stocks in order to satisfy 
the demand while awaiting delivery. 

3. The Speculative Motive 

If prices are expected to rise, it pays to keep stocks on hand. 

In operations research (OR), inventory is typically geared towards the first two 
motives. The third is occasionally treated in linear optimization as the so-called 
warehousing problem. 

Inventory theory belongs to the first and, therefore, classical application areas of OR. 
It was strongly supported in the 1950s, primarily by the US Navy. 

Scientists of the caliber of OSKAR MORGENSTERN, JAKOB MARSCHAK, 
KENNETH ARROW, HERBERT SCARF, THOMAS WHITIN, JACK KIEFER and 
others have intensively worked with the application of OR and Statistics on inventory 
problems. Its beginnings, however, go back to the mythical WILSON much earlier at 
about the turn of the century. At that time, the question of optimal inventory control 
strategies was a long disputed issue. This led to the development of the Theory of 
Dynamic Programming (RICHARD BELLMAN). 
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§2 ECONOMIC ORDER QUANTITY (EOQ) 

The standard case of the EOQ (or optimal lot size) problem is a trading firm which 
orders a good to store up stocks. Customer demand is satisfied by stocks on hand. We 
assume a constant rate of demand. Let 

A : Rate of Demand 

y : Stock on Hand 

The assumption of a constant rate of demand is a highly ideal one in wholesale and 
retail transactions. On the other hand, it occurs very frequently in raw material 
storage in continuous or batch type production. 

Cost Structure of the Inventory Model 

Ordering Cost: We assume a linear relationship for the ordering cost (Fig. 2.1). 


Order Cost 



k: 


fixed ordering cost. This covers administrative costs (e.g., $10-50; a business 
letter costs approximately $7), customer complaints, etc. 


INVENTORY CONTROL: MODELS AND METHODS 


3 


a: proportional ordering cost, e.g., transportation cost, cost for controlling 

incoming goods; in our model, it is mainly the buying price. 

Inventory Cost : It consists of interest costs, handling costs and rental costs for storage 
(even if one is the owner of the warehouse; in this case, the rental cost is an 
opportunity cost; the possibility of using the warehouse for other purposes is given up). 
Moreover, cost of wastage (in India, 1/4 of the grain harvest is eaten up by rats), 
depreciation and obsolescence can also arise. All these costs are summed up in the 
inventory cost. 

h : Inventory cost per unit item and unit time (Inventory cost rate) 

Shortage Cost : Shortages occur in case stocks are very low and, therefore, demand 
cannot be fully satisfied. These are charged penalty costs. 

g : Shortage cost per unit item and unit time 

z : Amount of shortage (deficit) 

G: Shortage cost 

Usually, shortage costs are assumed to be proportional to the amount of shortage, 


G = g z. 


Shortage costs can also be thought of as being independent of the size of the deficit z, 


G = g #z), 



0, for z = 0, 
1, for z > 0. 


6 is the so-called Kronecker delta. This second manner of calculating shortage costs 
was used, for example, by the US Navy. The inventory problem consisted of 
determining how many (replacement) parts a ship should carry to cover its needs 
during a voyage. Replenishment at sea happens infrequently. If the required number 
of spare parts of the same type is more than the number on board , then it is 
insignificant how many parts are lacking. High costs are incurred even if only a single 
part is missing. 
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Shortages can arise if the stock is not permanently recorded (periodic inspection), if the 
stock is ordered too late or when the ordered quantity is delivered late. 

The cost structure described here is very simple. A detailed discussion of differentiated 
cost considerations can be found, however, in business economics literature. 


The WILSON Lot Size Formula falsn ANDT.FR’s Formula or HARRIS’ formula) 

We consider the simple case of inventory with the above cost structure, constant 
demand rate and permanent stock control. Shortages are not allowed (see §10). The 
stock level is controlled by the following operational characteristic (Fig. 2.2) 


Stock y 



Figure 2.2: Operational characteristic of inventory control 

Since the delivery time is zero, it is obvious that an order only pays off when inventory 
reaches zero (t = tA The order quantity is 


D: 


Lot Size. 
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When the inventory level is again zero (t =^ 2 ), a second order is placed. Because A = 

constant, the system is stationary and there is no reason to choose another order 
quantity different from the first one. Since the situation at period t^ is the same as 

that in time t^ the optimal quantity in t^ must, therefore, also be optimal in t 2 * 

The demand rate A is pre-determined, i.e., independent of our actions. Hence, the 
area of optimization lies in the lot size. We need to determine the minimum cost order 
quantity. 

The objective function "Cost per cycle (t- — t^)" * s not suitable since minimizing 
these costs 

k + aD + h£*y-> Min 

la D 

average inventory-' I-cycle length t- — tj_j 


leads to the absurd result: optimal lot size D* = 0. 


A possible objective function is the average unit cost 


C: Average unit cost 


k + aD + h-£4 

(J =- -» Min . 

D D 


Another possible objective function is the cycle cost per unit time 


C: Cost during a cycle per unit time. 


k + aD + 

C =- — - Min . (2.1) 

D D 

I 
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Because of the proportionality C = AC and A = constant, it is irrelevant whether we 
use C or C. Both are convex in D. 



Figure 2.3: Convex objective function C or C 

* 

Therefore, we obtain the optimal lot size D by differentiating the objective function C 


min C(D) 
D 


<=> 





Equation (2.2) is called the WILSON Lot Size Formula or the HARRIS Formula. 
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This is a practical equation as shown by a short sensitivity analysis. The optimal lot 

* 

size D increases with an increasing demand rate A as well as with an increasing fixed 
ordering cost. 

Interval between Two Orders 
Let 

T: Interval between two orders 

From (2.2) one immediately derives 


2k 

JK 


(2.3) 


Average Range of Coverage 

An important reference value is the inventory-sales ratio. It tells something about the 
long-term efficiency of a stock control system. For an optimal ordering policy 


Average Inventory _ D 
Average Sales ~ TX 


TT 

WX * 


(2.4) 


Research has shown that, in spite of operations research, the average inventory 
holdings in the last two decades have increased. There are two reasons for this: 


1. Wages and salaries have increased sharply so that the rate k/h increased in 
spite of increasing interest costs (h increases) and the decrease in part of the 
fixed costs through EDP. 

2. Decentralization has increased the number of warehouses. Moreover, type 
variants (product variations) have increased such that the demand rate per 
variant and stock location have decreased which according to (2.4) leads to an 
increase in the average range of coverage. 


Returns to scale can also be seen from (2.4). The inventory-sales ratio of a company 
becomes more favorable with increasing sales volume. This, however, says something 
about the cost. The following section deals with cost considerations. 
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53 COSTS AND SENSITIVITY 
Costs 

The cost function C from equation (2.1) has among others a proportional ordering cost 
A - a. It can be seen that this term does not influence the optimal lot size. Over a long 
period of time, these ordering costs cannot be avoided and they can be considered to be 
fixed costs. Therefore these non-4nfluential terms are ignored in optimization. Let the 
new cost function without the ordering cost be 

c = C - Aa . 


For a period of length t 


k hD 

c = t + r 


(3.1) 


For an optimal ordering quantity D* 

c = fWXK . (3.2) 

As is to be expected, the cost of an order cycle per unit time increases with increasing 
sales volume. The increase, however, is sublinear: 




For the unit cost c = c/A per time unit 

«-r?- 


(3.3) 


It decreases, therefore, with increasing turnover. 
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This also shows the effect of increasing returns to scale (an advantage of bigger firms). 
The reason for this is INDIVISIBILITY, in our case, the indivisibility of the fixed 
order costs. Whether the order quantity is small or large, the fixed order costs, remain 
the same. 

In many large firms, however, inventory is decentralized. Because of this, the scaling 
effect is partly lost as shown in the following consideration. With m warehouses a 
single warehouse will have a demand rate of A/m. Let the total demand be A. With 
decentralization, the total cost per cycle is then 

m j 2khA/m = jm c , 

i.e., larger than centralization by a factor of jin. Decentralization has often been 
justified in the course of corporate history and, hence an energetic drive is needed to 
break out of the imposing structures and to reorganize the logistics. A similar stimulus 
in the form of high interest rates happened at the beginning of the eighties as one tried 
hard to rationalize the sales income of a corporation to free liquid resources. This 
resulted in the centralization of inventory in many firms. 

One should not ignore, however, the advantage of decentralization: improved customer 
service. This is not expressed in the above formulas (3.2), (3.3). For instance, they do 
not include the cost of transportation. 


Sensitivity 


C/C 

The partial derivative ^ gives information about how a change in each variable x 


affects the cost c. It is 


dc _ nnr dc _ faf 3c _ rxk 

~dX \TX ’ ; 
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These values, however, are dependent on the chosen units. One of the more important 
ratios is the elasticity e. It measures the relationship of the relative changes of two 
quantities 


dc 

f - c 
'c,A"^r • 

The elasticity can also be represented as a logarithmic derivative 

_ din c 
6 c,A ' din A * 


(3-4) 


The elasticity of c in relation to k and h are 

_ din c o i _ din c 
e c,k ■ cTfiTTc and e c,h ■ 3I5""E ’ 

respectively. 

With c = fik\h, one obtains 

TT“ 

6 c,A 6 c,k 6 c,h g 


The elasticity of cost per period in relation to A, k, h is always If, for example, the 

cost of k or h increases by p% then the total cost c per period increases by jj%. The 

same is true for the cost unit c. It is important to be sure about the sensitivity of c 
and c, because it can rarely be assumed in practice that k and h can be exactly 
determined. 

d c 

The cost sensitivity in relation to changes in lot sizes, is also interesting. It is not 

always possible to realize the minimum cost c. Some of the reassons are technical 
conditions (container size, truck or tank capacity), special packaging or an arbitrary 
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order cycle: week, month or quarter. Let us denote for a moment the optimal value 
with an asterix (*). With the help of the Taylor expansion about c*, we compute the 
cost difference c — c*. It is 

c = c(D) = jp + jp (compare (3.1)) 

dc _ kA^h 

_ 2kA 
<3r 


and, hence, 


c - 



0 + (D-D»)^2kA + 

1 (D*) J 


t 


dc 


the linear term disappears, since w 


D 



Ignoring higher terms we have 



How much it amounts to must be checked in each case. 

Example: 

Let k = $8, h = $0.01/day and unit, A = 1 unit/day. Then D* = |2Ak/h = 40 units. 
This lot size is enough for 40 days. The cost c* per day is c* = |2kAh = $0.40. 

The good, however, can only be obtained in minimum lot sizes of 50 units: D = 50. 
How much is the increase in cost per day? 

c - c* » Ak ( D - PT = $Jj = $0.0125. 

( D »)3 
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This is an overestimate. The actual cost difference where c is computed according to 
(3.1) amounts to $0.01. This means that Ac/c = 2.5% for every change AD/D of 
20% Average deviation from the optimal lot size is therefore not very noticeable. 
Reason: Ac is in the first approximation quadratic in AD. 


§4 RM-SYSTEMS (ABC ANALYSIS) 

The abbreviation RM stands for the latin "reductio ad maximum". In an RM-system, 
the goods are arranged according to importance. The importance of good i is 
considered according to its sales volume A-a-, measured in terms of its buying price 

(and not by its selling price for which we measure costs). In our model, the buying 
price is the proportional ordering cost a-. 


Earlier research has shown that the goods may be roughly classified into three classes 


Class Number of Items 


A 20% 

B 40% 

C 40% 


A pproximate ^A^ 

65% 

27% 

9% 


This type of classification is popularly known as ABC—Analysis . 

Is Aa actually the right criterion for a classification based on cost? The cost function is 
c = \ 2kAh. Accordingly, the criterion should be Akh. If, however, k is constant for all 
goods and h ~ a (interest cost!) then 


Aa ~ Akh. 

This gives the theoretical justification to use Aa as a measure for the cost which causes 
one to hold inventory. 
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The purpose of an ABC classification is to save inventory management costs. Only 
goods of class A (highest importance) are handled according to the best possible 
methods. Note: In this case, continuous stock control is often necessary! The simplest 
inventory models are used for goods in classes B and C . One normally relies on the 
rule of thumb. 


55 PRODUCT-MIX DECISION 

An ABC-Analysis can lead to the decision to straighten out the product mix and to 
discard certain articles, such as goods with high costs or low demand, the so-called 
slow-movers. 


Let 

p. : Selling price per unit of good i 

: Buying price per unit of good i. 


Then the profit per order period of length T is 


T(Pf — a^) — ^ . 


The Rate of Profit Gj = Revenue minus Cost per period 


k • + h • D • 75 - 

— a^) rp 


~ ^ 2k^A^h^ . 


The optimal product mix is given by all goods with a positive rate of profit. The 
cutoff Xj of demand, when G* = 0, is called the BREAKEVEN POINT. For Aj < X-, 

good i lies in the loss area, for A^ > Xj in the profit area. 
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2k i h i 

The break-even point is set at J- =-?y . 

1 (Pi ~ a A ) 2 


G i 



Figure 5.1: Break-even analysis 

A systematic product mix adjustment is often done, for example, in the case of 
booksellers. If the rate of sales falls below a critical value, the books will not be 
republished and the remaining stocks will be sold cheaply. To avoid the danger of 
selling out too early, it is important to know Aj exactly. 


§6 ESTIMATING THE RATE OF DEMAND 

Sales data in disaggregated form (not monthly, quarterly or yearly sales figures) are 
not always available. Therefore, we use previous order data to estimate the rate of 
demand. Let 

t Vi tVi 

t- : Interval between the last (i+l) in and i in order 

t Vi 

(Note: it is counted from the past, i.e., t* is the i in period in the past) 




INVENTORY CONTROL: MODELS AND METHODS 


15 


Dj: Order quantity at the last i** 1 order 

(Replenishment order!) 

L : Last i in auxiliary value for A; 1. = D./t^ 

(Note: The order D. is the substitute for the demand before the last ith order) 

In the lot size model it is assumed that A is constant. Hence, it has to be tested 
whether the observations actually support the assumption. A quick answer can be 
obtained from a visual inspection of the series {h}- ^ . 


li 



► t 


Figure 6.1: Time series of observations 1* 


If the observations 1- as is shown in Fig. 6.1 fluctuates around a long-run constant 

average, then the arithmetic mean from the n existing observations is a suitable 
estimator for the true A 



( 6 . 1 ) 


If one chooses only each of the last m observations, m is fixed, and one speaks of a 
moving average. 
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If the observations stretch over a longer period, shifts in the demand process, as a rule, 
will occur because of product mix changes, changes in brand loyalties, etc. It is then 
meaningful to give more weight to newer data than to old ones. By geometric 
weighting, one obtains for n -> oo: 



p < 1. 


p is the weight factor. 


( 6 . 2 ) 


The advantage of this weight is that it allows A to be easily computed recursively. It 
is 


A t+ i = (i-^lj + p\, t = 1,2,... (6.3) 

We substitute p with 1 - p and obtain the usual terms from time series theory 


A = pl l + (1 - p )Aj 


(6.4) 


1^ is the last observation, A^ the previous and A the new estimate for A. The 

equivalence between (6.2) and (6.3) is easily shown by successive solution of the 
recursion (6.3). 

The estimation procedure (6.3) or (6.4) is called a first-order exponential smoothing. 
The previous values are exponentially damped. Through this, the speed of adaptation 
to a sudden occurrence of a change in level is increased considerably as compared to 
the arithmetic mean method. It becomes specially clear if one formulates equation 
(6.1) recursively 

\+l = t + l 1 ! + t + l\ 


(6.5) 
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and let t be very large. The latest observation is taken as the new estimate value with 
a weight of l/(t + l). As time increases, this influence always becomes smaller. On the 
other hand, it remains constant with exponential smoothing. 

The theoretical basis of the first order exponential smoothing lies in the modelling of 
an adaptive expected behavior according to the formula 

E{A t+ 1 }-E{A t }=p(l!-E{A t }) , 

from which 

H\ t+1 }=pl 1 + (l-p)Z{\ t } (6.6) 

It describes the structure of the time series which fluctuates about a constant level, 
where this level is itself disrupted by random displacements. 

E {•} is the Expected Value Operator 



Figure 6.2: Time series with level displacements 


Exponential smoothing is an appropriate forecasting method with this type of time 
series. Time series theory, moreover, gives a general statement about the structures of 
time series for which this forecasting method is optimal. More discussions on this and 
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other refined variations of exponential smoothing can be found in BOX/JENKINS 
(1976) and MAKRIDAKIS/WHEELWRIGHT (1987). 

The usual values of p lie between 0.01 and 0.1. The choice of a suitable value p is 
again in itself a decision problem where the perception about the speed of adaptation 
comes into play. 


§7 PROFIT MAXIMIZATION 

We assume that a good is sold at a price p per unit, bought at price a and the other 
data as before. The objective is profit maximization. The average profit per unit time 
amounts obviously to 

pD — aD-k-h5-$- 

8 =-D/3-— * (7- 1 ) 

If the revenues and costs occurring within an inventory period are divided by the 
length of the period, then 

g = A(p - a) — ^ 1D 


g = A(p - a) - c (7.2) 

where c represents, as before, the average cost of inventory (compare §3) per unit time. 
Furthermore, 


Max g = A(p — a) + Max(— — It D) 
D D vt 


- A(p — a) — Min(-jj— + j D) . (7.3) 


The profit maximization problem is therefore identical to the cost minimization 
problem of the standard inventory theory except for the additive constant A(p-a). 
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§8 INVENTORY EVALUATION 

A firm has a license to engage in the warehousing business until time period T. Let the 
current stock be y at a given period t. How large is the commercial value of the firm? 
In other words, how does one evaluate the inventory y? 

The value of the firm is obviously a function of the stock level y as well as the 
remaining time T-t. It is described by 

v(y, T-t) 

During a short period At, it evolves as follows 

i 

v(y, T -1) = p AAt - hyAt + v(y - A At, T — t — At), y > 0 (8.1) 

since the current revenue is pAAt, the current costs are hyAt and stocks are reduced 
by -AAt. 

If y =0, then 

v(0, T -1) = —k - aD + v(D, T -1), y = 0 (8.2) 

applies because stocks must be replenished up to D and that causes the cost k + aD. 
The Taylor—Approximation for v(y - AAt, T-t - At) is 

v(y — AAt,T -1 - At) = v(y,T — t) — v • AAt — v^ ♦ At 

Substituting in (8.1) and dividing by At result in the partial differential equation for v 

Av + v = Ap - hy (8.3) 

y ^ 

with the boundary condition (8.2) and end value condition 
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v(y, 0) = 0 . (8.4) 

So that the end condition will hold, assume that 

y(T) = 0 

i.e., a final stock of zero is planned. 

It is not unreasonable to attempt to separate the value function into a purely 
time-dependent and a purely volume-dependent part 

v(y, T -1) = w(y) + g • (T -1) . (8.5) 

In addition, the time-dependent part is set proportional to the remaining time. The 
proportionality factor is to be interpreted as the rate of profit per unit time. Using the 
formula (8.5), the partial differential equation (8.3) yields 

Aw'(y)+g = Ap—hy. (8.6) 

Integrating from 0 to y gives 

w (y) - w(o) = (p - f) y - y 2 • (8.7) 

In particular, 

w(0) = 0 . 

Using the boundary condition (8.2), one obtains for y = D, 
w(D) — w(0) = k + aD = (p — ^)D—. 

The rate of profit is then determined to be 


g-A[p-a-jj-7FjD] . 


( 8 . 8 ) 
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The profit margin with rate A is 

p — a — jj — Tj-jj- D - p — a — c 


c is the unit cost per time (compare (3.3)). Substituting (8.8) and w(0) = 0 in (8.7) 
results into the value of inventory y 


w (y) - + n + y ~jx y2 • 


(8.9) 


The value of the business consists of the value of inventory (8.9) and the value of the 
remaining time g (T—t). The value of the stock is a quadratic, not a linear nor a 
proportional, function of the stock. It reaches its maximum at 


* ^ A(a + p + D) 


y = a| + d 


( 8 . 10 ) 


using the Wilson lot size formula for D. The value of the stock increases therefore with 
the stock in the whole range 0 < y < D. 

If one considers only the added value of stock m(y), i.e., the surplus above the buying 
price a, then according to (8.9) we have 

ra ( y ) = 4 + y ~?x y2 


f^kh h 2 

m ( y K -r ,y -^ y • 


(8.11) 


This added value assumes its maximum if 


dm ! A 12kh h A 

3y = 0 ’ ^^T-I y = 0 
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The optimal order quantity is therefore the one which maximizes the added value of 
inventory. 

The evaluation of inventory levels and its clear delineation from the time value of a 

_ ___ • • 

business enterprise are relevant economic problems (GRUBBSTROM). 


§9 QUANTITY DISCOUNT 

A modification of the standard model is necessary when considering quantity 
discounts. We differentiate two cases: 


Case L 

Quantity discount is only given for the parts of orders whose quantities exceed q^. 


Unit Price 

▲ 

a o- 

a r 



Figure 9.1: Discount scale 


- ► 

Order quantity 
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D > q Q is not an interesting case. We, therefore, assume D < q o first and ask what 

A. 

the optimal order quantity D is if more than is ordered. 


The average unit cost is 


C = 


k + q 0 a Q + (M 0 )a 1 + h 


D D 
1 ' I 


IT 


K h n n 
U + 0 D + a l ’ 


D >q 0 ? 


(9.1) 


where K = k + a (a - a i)- C is convex. We disregard for a moment the condition 
D > a and obtain using dC/dD ± 0 as minimizing lot size 


D = 


20 

T 


It is to be tested whether D > a . For D < a , C(D ), C(D) is the global minimum. 

A A £ 

For D > q o , C(D) and C(D ) are two relative minima and it remains to be determined 
which of the two is the global minimum. 

A comparison of unit costs results into (note: C = c + a) 


^ A o 


IKK 

~T 


+ a 


1 ' 


(9.2) 


Case D 


Case D 


Example: 

Let k = 8, h = 0.01, A = 1, = 100, D* = 40. 

How large must the quantity discount x = a — a. be so that it pays to take advantage 
of it? 
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The advantage is cancelled at 


0.4 + x = 


2(8 + lOOx) • 0.01 


^ x = 1.2 . 


Is it also true that D > a ? 


6 - - “o > % 


Case Z 

The lower price a, is chosen for the whole order quantity D as soon as D >V 

* 

Again, let D < a . Then it is clear that an order quantity D > q Q is not worthwhile. 
But maybe D = a ? The cost comparison is again the criterion 


2H 


+ > 
o 


Case D 


q“ + 2T ' q o + a l ' 
Case D = a 


(9.3) 


Example: 

With the same cost figures as before, the criterion (9.3) gives 
0.4 + X % jjjg + QjQi. 100 . 


Indifference is reached at x = 0.18. The discount jump is now considerably lower than 
the first case. 
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510 CXMJLECHVE OR SINGLE ORDER ? 

If a number of goods come from the same supplier, then a collective order can pay off 
under certain circumstances. Let 

kj, hj, : Fixed order costs, holding cost and rate of demand of good i 

k : Fixed order costs for collective order 

o 

c c *. Collective order costs per unit time 

c : Single order costs per unit time 

ij 

Sin gle Order: 

The cost per unit time (without proportional order costs) amounts on the average 
(compare (3.2)) to 

C S = l ^Wi • (10.1) 

1 

Collective Order: 

The reorder time must be the same for all goods. For an order cycle of length t, each 
lot requires D. = A^t. The cost of one cycle per unit time (again without proportional 

order costs) is (compare (3.1)) 

_ k o + \ 1 h i D i t 


t 


( 10 . 2 ) 
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Therefore, the objective function is 



We substitute the optimal cycle length T in (10.2) and obtain as minimum cost c for 
the collective order 
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Compare: c = c ? 

By cost comparison one can cancel the common factors such that the question becomes 



Case L 

k Q = ^ k-, i.e. with fixed order costs, a collective order does not have an advantage 
over a single order. The computation shows 



• » • 

1 i i 

The scalar product of both vectors K, A are on the left side and the product of these 
quantities on the right. Therefore, 



To avoid trivial cases, we can, as a rule, assume K, A > 0. Then the equality sign 

disappears and the single order is better than the collective order. Reason: In a single 

* 

order, the different individual optimal lots Dj are ordered. This is not possible in a 


collective order. 
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Case 2. 

k = k- = k. In this case one expects a cost advantage from the collective order. The 
computation confirms this. It is 



Squaring both sides gives the obvious statement 

2 ~ 

(2 Aj) > 2 Aj for A- > 0 . 

Case 3. 

kj = k + k = k + ^ r?.; i.e., the fixed order costs consist of a basic value k and a 

l 

product-dependent value 7?. . In this case, it could be c < c c , as well as, c s > c . 

§11 OPTIMAL STOCKING IN SERIAL PRODUCTION 

Lot sizes occur not only in commercial warehouses but also in production. We consider 
the simple case of production with continuous withdrawals from the finished parts 
inventory. 

An example is engine production in an automobile factory. The production program 
for the next half-year has scheduled the assembly of four-cylinder vehicles at a 
constant rate. These vehicles are manufactured daily on the assembly line. How large 
is the production lot of these engines? 
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i 


Let 


r 

[. 

; n: Production rate, n > A 

D: Lot size minus the continuous withdrawal during production of a lot (net lot 

size, stocking lot size) 

The stock movement has the following characteristic 



fj. - X X 


Figure 11.1: Stock control with replenishment through production 
The cost per unit time is 




Min . 
U 


( 11 . 1 ) 
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The optimal net lot size is 



4 



( 11 . 2 ) 


Instead of the rate A in (2.2), the harmonic mean of A and p — A now appears: 

The gross lot size (stocking lot size plus the continuous withdrawals during production) 
is 



The fixed ordering cost k in this case includes the set up costs and the startup costs. 


§12 STOCK-OUTS ALLOWED 

Until now we have been considering the inventory model under the condition: Stock y 
> 0. Now let us allow stock deficits. We differentiate two cases: 

a) LOST SALES CASE. Unsatisfied demand is lost. 

b) BACKORDER CASE. Unsatisfied demand is deferred until supply is again 

available. 

We consider the BACKORDER CASE. In practice, under conditions of certainty, this 
can only be done by a firm without competition, i.e., a monopolist (with the attitude 
"the public be damned"). It is, however, different with stochastic demand. In this 
case, one can not in all cases, even with the best intentions, guarantee a 100% delivery. 

As a rule, it will cost something if a shortage occurs. On the other hand, if these costs 
are not too high, stock deficits may also be cost effective. 


INVENTORY CONTROL: MODELS AND METHODS 


31 


Inventory y 



Figure 12.1: Operational characteristic of inventory 

control with shortages (Backorder case) 


The stock deficit M is evaluated with the proportional shortage cost g. 


The cost per period is 


k + h 


c = 


D D 

• j • j + g 

(D + M )/A 


M 

J 


M 

2 


Min 

D,M 


c(D,M) is convex. From 


dc . 

Wi 


h d 2 

_ I D 

(D+M) 2 (D+M) 2 


kA 


gM(D+M) 


_ 


+ 


(D+M) 


J 


i 0 


( 12 . 1 ) 




32 


CHAPTER 1: DETERMINISTIC INVENTORY MODELS 


sc._ k A _ - 1 1)2 i o 

W ' (D+M) 2 (D+M) 2 (D+M) 2 

=> Equality of Terms 

^ hD(D+M) = gM(D+M) 

it follows that 



( 12 . 2 ) 


The deficit, therefore, is always greater than zero regardless of how high the shortage 
cost is. 

Reason: If one hesitates to order beyond T- then shortage costs increase quadratically 

with time (t—Tj). For small At = t-T^ the cost curve is flat. The deferred demand 

Aq does not entail holding costs. If one stocks up so that Aq could also be satisfied 
then one must stock Aq for the whole period Tj. 


From (12.2), 



Hence, the cost function becomes 

c = }[k + h^T 1 +g ^T 2 ] = 
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c -»Min £4 
T 


dc _ k 
3T" ^ 


+ 




4 



(12.3) 


The optimal order is 



(12.4) 


Both these results can be reduced to the formulas (2.2) and (2.3) for g -» ®. 


§13 DISCRETE LOT SIZES 

In our previous inventory model the order quantity took on a real value. For small 
lots, however, the requirement of discreteness cannot be ignored. The holding cost per 
cycle is now 


D-l D 

1 v v ('n • \ h V ; h D(D + 1) 

J- h 1 Z ^ 2 1 • 

i=0 j=l 

1 

The period is j, during which the inventory remains at its present state, i.e., the time 
between two demands. 

The objective function c (cost of an order cycle per unit time, without proportional 
order cost) is 

c = ^ + ! (D+ 1) ^Min . 
u 1 D € N 

The condition for the minimum of the convex function c for whole numbers n is 


(13.1) 
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c . = Min {c } =» c , , > c , < c , . n . 
n jj 1 n —1 _ n* _ n* + l 

(see the following figure 13.1) 



n* 


Figure 13.1: Convex function c with discrete argument 
One considers the first difference 



For n* it changes from negative to positive. 


Example: 


For A = 1, h = 1, k = 1, the optimal lot size D according to the WILSON formula 
(2.1) would have been D = |2. 


Should one round up or down? It is better to proceed from D instead of D using the 
first differences. 
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We have c(0) > > 0. For the objective function (13.1) it is 


Aj = y + 1 - c(0) < 0; 


1 . 3 


A 2 = 2 + 2~~ 2 = () ; ^ the Minimum occurs in two places n = 1 and n = 2 


Arj — 7^+2 — 2 > 0 . 


* * 

Hence D = 1 and D =2 are both correct solutions. 


§14 CONSIDERATION OF SHELF SPACE IN INVENTORY 
Reserved Shelf Space 

In a multi-product inventory, we assume a specific shelf space is always designated for 
an item in order to have quick access to this particular item. Hence, the holding cost 
depends on the size of the reserved area. It is similar to the maximum stock quantity, 
D. 


Let 


hj : holding cost proportional to quantity 
\*2 ' holding cost proportional to shelf space 


The cost per unit time for an inventory cycle of length D/A is 


c = 


, , D Di n D 

k 1 h i r r h 2 D ' i 

B7* 


It is minimized when the lot size is 



"ZITA 

+ 2h2 


(14.1) 
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Hence, the effective inventory cost is the sum of the holding cost h, and twice the cost 
of the reserved storage space. 


Periodic Determination of Order Quantities 

Two products are stored. Shelf space is not reserved. The order pattern is the same 
for both products. The cycle length is T. One can shift the order points of the two 
products such that the maximum required total storage area is minimized. We denote 

t : Phase displacement of orders of product 2. 


Inventory 



T 


Figure 14.1: Periodic stock movement (single and composite) 

Total inventory manifests itself at two peaks. 

Peak 1: Order of product 2 
Peak 2: Order of product 1 
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The optimal phase displacement results from the condition 
Min {Max (Peak l|Peak 2}} . 

T 

The minimum is reached at the point where the two peaks have the same height 


A 2 T + Aj (T — r) = A^T + A 2 T 


(14.2) 



(14.3) 


We substitute r in (12.2) and obtain the maximum stock 



(14.4) 


It is symmetric in A and proportional to T/A ^ + A^ 


When the total rate A^ + A 2 is constant, the expression 


Ai + A ][ A 2 + 

A 1 + A 2 


assumes a minimum for Aj = X~. The proof is left to the reader. 
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515 BUDGET RESTRICTION 


In a multi-product warehouse, the individual goods compete for storage space. In a 
small storage room, one cannot expect that the total area will cover storage of the 
optimal lot size Dj from (14.1) for each good i. As a rule one must get by with a 

fraction of D-. This leads to an inventory model with capacity restrictions. Instead of 

limiting the storage area, the available capital could also be limited: either the current 
inventory asset or the current account, limited by the available credit line, in case all 
orders within an inventory cycle are paid for at the same time. 


Let 

b* : Space requirement or price per unit of product i 
b Q : Total inventory capacity or budget 
Xj: Lot size 

We minimize the costs £ c- of a cycle per period (compare (2.1)) 

i 

Min 

x l,..., x N 

with the constraint 
N 

l b i x i S b o 

i=l 



(15.1) 


(15.2) 


Using the method of Lagrange multipliers: 

The constraint (15.2) is added to the objective function using the Lagrange multiplier 
13. The expanded function is called the Lagrange function L 
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o h i 1 



L = - 

1 x. + !r x i 

♦ i • 

L i i J 


b o - Z b i x i. 
• 

1 


T 

since Min! 


> 0 


L is a concave function, therefore it is sufficient for an extremum 


dL ! rt 11 1 /TL rt 

35T-° : “2-2 /^i - 0, 


k • A • li • 


x. 

l 



(15.3) 


(15.4) 


For >3 = 0, i.e., if the budget restriction is never effective, (15.4) again becomes the old 
WILSON Formula (2.2). A comparison of both formulas shows that the budget 
restriction occurs in the form of higher inventory holding costs. If one uses only 
interest to represent holding costs, at a rate which is profitable given the available 
capital (e.g. the firm’s internal rate of return) and if b- is the capital investment per 

unit of product i, then the budget restriction leads to an increase in the nominal 
interest cost. 

If one interprets the constraint (15.2) as a space restriction, then it has the effect of an 
additional rental space of 2 fi per unit area. 

When does a reduction of order quantity for all goods (and, hence, the inventories) 
occur in the same proportion? It is sufficient that 

b. ~ h. , 


since with bj = oh-, a e R, i = 1,2,...,N, (15.4) results in 
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The optimization problem (15.1), (15.2) can also be formulated as a non-linear 
program 



such that 




2) x. > 0, 


i = 1,2.. 



Since the optimization problem is now restricted to x- > 0, a boundary extreme could 
occur, but that is never the case in the above objective function. 

Determination of 0: ^ > 0 applies. Since all > 0, ^ = 0. 

From this follows 



The bigger b is, i.e., the weaker the constraint equation, the smaller 0 becomes. 
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P 



Figure 15.1: Dependence of an additional cost /? on the budget 
At b > E, the budget is no longer fully utilized. 


Example: Inventory Costs 


h i = ra i 

b.= a i 

I a i x i * b o 


Interest 

Capital budget costs 
Proportional order costs 
Budget constraint 


Then 



0 is the shortage cost which is added to the capital. 
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From the last equation it is obvious that the relationship of each a^xj/a-x- to each of 

J J 

the individual lot sizes does not change if one increases the budget b . 


§16 KNOWN BUT VARYING DEMAND 

Until now the demand rate was considered as constant over an infinite time horizon. 
For commercial warehouses this occurs only in exceptional cases. There is a similar 
situation with manufacturing inventories, for example, where purchased parts are kept 
in stock to be used as components in serial production. A constant demand rate is also 
rare in that case. Hence, we drop this restrictive assumption. Let 

Aj: Demands during the periods i = l,2,...n 
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Demand is, therefore, known up to period n. 

Example: Automobile Factory 

Depending on customer’s preferences, a vehicle can be equipped with a wood grain 
steering wheel. The daily requirements for the steering wheel are determined from the 
bill of materials for the vehicles prepared from the preceding quarter. These are 
ordered from a supplier. What is the optimal lot size? 

One can formulate this problem as an integer programming problem with a planning 
horizon of n. However, this would not be a proper use of the model since, after a 
fraction of the planning horizon has passed, new information about the requirements 
for the period after n already exists. Strictly speaking, we are dealing with a rolling 
planning horizon. It would be a fruitless effort to want to solve this problem exactly. 

There are two possibilities for this model: 

a) Either one knows how the problem behaves in the future, i.e, one can, at least, 
give probabilities for future demands. Such models are treated later. 

b) Or, one considers it as an infinite horizon problem. One asks how long an order 
will suffice. The time period T is chosen such that the average costs are 
minimized. This procedure is repeated each time the stock sinks to zero. 
Surely, this method is not optimal but is more practical than the exact method 
because one need not wait for a new decision until the whole planning horizon 
has passed. In the literature, this method is known as the SILVER-MEAL 
heuristic. (SILVER & MEAL (1973)). 

The objective function is to ’’minimize the average cost of the first order cycle” 


c(T) -> M i n 
T 


(16.1) 


c(T) = £ + ^{A t T + A t _ 1 (T - 1) + ... + Aj] 


(16.2) 
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Note: The quantity A- is stored for i periods. Minimization occurs in the discrete 
variable T. The optimality conditions are 


c(T) - c(T - 1) < 0 ; 
c(T + 1) - c(T) > 0 . 


(16.3) 


These are, however, only necessary but not sufficient, since (16.2) is not convex. 

T 

ll V 

The convexity is ensured if y ) iA- is monotone, i.e., for 

i = l 

T 

T(T + l)A-p + ^ > ^ iA- ; T = 1,2, ..., n. 

i=l 


Example: 

Let k 12, h 

= 0.1, n 

=8 






T 

1 

2 

3 

4 

5 

6 

7 

8 

A'T' 

5 

3 

6 

2 

4 

w- 

4 

7 

k/T 

12 

6 

4 

3 

2.4 

2 

1.7 

1.5 

I iA i 

5 

11 

29 

37 

57 

75 

103 

159 

T U i 

0.5 

0.55 

0.97 

0.93 

1.14 

1.25 

1.47 

1.99 

c(T) 

12.5 

6.55 

4.97 

3.93 

3.54 

3.25 

3.17 

T = 7 

3.49 

for h = 1, two local minima are obtained! 






c(T) 

1 17 

11 .5 

13.7 

12.3 

13.8 

14.5 

16.4 

21.4 


T =2 
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We now consider the problem in continuous time 

T 

c(T) = i [k + h / tA(t)dt] . 

o 

Minimizing with respect to T: 



T 2 A(T) - JtA(t)dt 
k , v o 

+ t £2 - 



£ = h[TA(T) ) tA(t)dt] 

1 0 



Average cost with respect Limit cost of 
to time for a sufficient the order cycle 

order 


This equation illustrates the economic principle in cost minimization: 

The average cost must be equal to the marginal cost. 

Other methods, for example, are the method of the smoothed economic lot size 
(minimization of the unit cost of a lot) and the "Part—Period—Method" of 
DeMATTEIS and MENDOZA (1968) (Minimization criterion: the order and holding 
costs are equal for optimal lot sizes). The latter, as a rule, achieves better results 
(compare OHSE (1970)). 

Studies by KNOLMAYER (1985) have shown that the heuristic by SILVER and 
MEAL gives the best results for rates of demand which fluctuate about a constant 
average. 
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Two modifications of this method stem from SILVER and MILTENBURG (1984). 
One was developed for the case of monotonically decreasing demand and the other for 
the case of sporadic demand. 

Of course, there are some situations in which it is meaningful to solve the lot size 
problem exactly; for example, if a branch is to be closed in the near future. The 
production program during this phase-out is fixed and with it the demand for raw 
materials in the different periods. A rolling plan is not applied here. Dynamic 
optimization is a possible method of solution for this problem formulated by 
WAGNER and WHITIN (1958). 

In general, the algorithm of WAGNER and WHITIN performs poorly compared to the 
SILVER-^fEAL heuristic (BLACKBURN and MILLEN (1980)). However, CHAND 
(1982) has modified it such that, according to his own statement, it performs much 
better than the SILVER—MEAL heuristic. 


§17 FIXED DELIVERY PERIOD r 

the time of 
Obviously, 
The stock 

y = A • r 

If the delivery period r is longer than the duration of a cycle 

T -jD' 


If the delivery period r is not zero but positive, constant and known, then 
ordering and the periods of maximum stock levels should be differentiated, 
the order must be placed r time periods before the stock becomes empty, 
level is at its optimal order point when 


then orders occur in each time period and, in certain periods, more than once. If that 
is not allowed, one must always order the quantity Ar at the moment when the last 
order has been delivered. Because of this, costs increase compared to the case in which 
frequent orders are made. In general, firms seek to avoid early deliveries in the same 
way they attempt to avoid late deliveries by imposing contract penalties, etc. The 
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punctuality and reliability of suppliers in Japan are cited by US automobile companies 
as production advantages of their Japanese competitors. 


Weak Demand 

For a product which is rarely demanded, the question is asked whether one should 
stock this product at all. 

a) Do not hold in stock: A penalty cost per turnover is incurred in the amount of g r 

b) Hold in stock: A holding cost per turnover is incurred in the amount of h/A. 

Fixed order costs are ignored for comparison. The product will not be stocked for gr < 
h/A, i.e., 



(17.1) 


A similar situation arises in the mail-order business. Each customer placing an order 
is seen as a marketing outlet. In this extreme form of decentralization, the rate of sales 
per outlet and per product is low. Savings in terms of inventory holding costs are 
partly returned as a price advantage to the customer. 

This model also occurs in the pharmaceutical industry. The demand for a given 
medicine at an outlet (pharmacy) is low and the delivery time is very short (a few 
hours). For this reason, pharmacies stock only a basic assortment of medicines. 


§18 SAFETY STOCK WITH STOCHASTIC DELIVERY TIME 
(including JUST IN-TIME PRODUCTION) 

We continue the discussion of the case of a constant and known demand. Let the 
delivery time r now be a random variable with an expected value n T . If one specifies 

the order point s^ = A/ip (this is already the demand during the expected delivery 

time // ), then, in the case of a symmetrical delivery time distribution, one would have 
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as many stocks as shortages immediately before an order arrives. This is only optimal 
if the inventory cost rate, h, is equally as large as the shortage cost rate, g. By 
increasing the order point to S 2 > s^, the risk of shortage is reduced. The amount 

s 2 —5i is the safety stock . Its purpose is to cover any deviations in expected delivery 

times. 


y 



Figure 18.1: Stock movement with and without safety stock 

We assume that a delivery time n was agreed upon with a supplier. Because of 

unforeseen circumstances, delays (e.g., production bottleneck, slow customs clearance 
procedures) or early deliveries (e.g. as a result of route planning of deliveries) may 
occur. The processing of orders during receipt of goods and during quality control may 
also cause fluctuations. These deviations are unexpected and are, therefore, considered 
in the model as an occurrence of a random error e . 

T 



INVENTORY CONTROL: MODELS AND METHODS 


49 


: Random deviation from the agreed-upon delivery schedule, random quantity 

with distribution function P(e r ) 

The total delivery time r 

t = n T + e T (18.1) 

is then a random quantity. 

The problem of uncertain delivery time is often solved heuristically. One determines a 
percentage service level 

SERVICE LEVEL 0 - |{ * 100, 

e.g. 0 = 97% The probability of shortage is 

Prob(y <0) = 1-0/100. (18.2) 

The safety stock S 2 — should be large enough to allow the firm to achieve the given 
service level and the desired shortage probability. 

If one knows the distribution function P(e r ) of the error variable, then the safety stock 
may be derived from (18.2). The value s^ = is known. Shortages occur if the 
delivery time is longer than the duration of the safety stock 

s 2 

+ C T > J~ ’ 
i.e. 

s 2 

f > V- — Li 

T A 
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For a fixed s ^ the probability of the occurrence of shortages is 


Prob(y < 0) = Prob(e r > 


(18.3) 


' 1-P (j 

We choose S 2 such that the probability of shortage takes a desired valued, i.e., it 
satisfies (18.2). Hence it must follow 

1 — 0/100 = 1 — P (ip — n r ), 


0/100 - P(^— p, T ). 


(18.4) 


One obtains the order point S 2 from the 0-percentile of the distribution function 
P(£ r ). If, for example, the error variable is normally distributed N(0,a ), then from 
(18.4) 

0/100 = N(^p - h t ) 

and by standardizing to the N(0,1) normal distribution 


So T 




Let be the 0-percentile of the N(0,1) distribution. For S 2 , it results into the 


condition 
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4 


Srt — 6 nti + A 

l p/t T UT 



(18.5) 


Since A n r = we obtain for the safety level SB the term 



(18.6) 


Inventory y 



VX 


Figure 18.2: Safety stock level and shortage probability with order point s^ 

The density function is superimposed on the graph. 
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The corresponding percentages with (0,1)—normally distributed delivery times for some 
service levels are given in Table 18.1. 


Table 18.1 


Service 

Level /? 


90 

1.2816 

95 

1.6449 

96 

1.7507 

97 

1.8808 

98 

2.0537 

99 

2.3263 

99.5 

2.5758 

99.6 

2.6521 

99.7 

2.7478 

99.8 

2.8782 

99.9 

3.0902 


Delivery Delays Only 

Often the firm is concerned only with late deliveries and is indifferent toward early 
deliveries. Time is then 


T = H + c T , 

where // is the agreed-upon delivery time and t T is a random variable which can take 

on only nonnegative values with a density distribution which, for example, has the 
following characteristic 
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Density 

A 


11 1 “ 1 -i i ■ ‘' ; , 

i “ •<’ ' , ‘ ,1!- , " ’ \ 


_ i" :• . , \ 

‘ ' . 




• Pv 


1 " t’J, c • ■’ 1 > . . . 

> ,H - -. ,‘> * . ,!• \ 








U<t»: 


.-i'.'it n 


6-Percentile 




■i 7 - 




Figure 18.3: Density distribution of a random delivery delay e r 


If one wants to ensure himself against /3% of all delays, then one has to store a safety 
stock which is enough for the period until the /2-percentile, i.e., until c^% 



(18.7) 


Note: In comparison to (18.6), the distribution of the delivery delay here is not 
normalized. Hence the term c r does appear as a multiplier. 


Safety Stock in Just-in—Time Production 

We now consider as products component parts which are manufactured by a supplier 
and put together on an assembly line. If economic or space limitation reasons dictate, 
the component parts may be produced on the day they are needed and delivered to the 
assembly line "just-in-time". The time planning runs, in principle, according to the 
following schema: 
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Supplier: Material procurement Finite planning Control 

Rough Planning 

-> Time 

Manufacturer: Announcement Report of weekly Report of daily 

of production requirements requirements; exact 

program determination of variants. 


The last important phase requires close communication and strong discipline in 
carrying-out production and transportation 


Work Production and Transportation 

preparat ion Cont r ol 


Time 


Final determination of Assembly line Installation 

all part and production preparation 

schedules 


Nevertheless, delays in deliveries may occur. These must be buffered with a safety 
stock at the assembly line. It is calculated as before according to the formula (18.7). 
Note, however, that contrary to inventory models, the schedules are not 
stock-dependent but are directed by the production flow of the manufacturer. Hence, 
one needs a time buffer instead of a quantity buffer. Instead of the safety stock, an 
order is placed earlier within the interval Quite naturally, the consequence of 

this early order is also an early delivery and, hence, a stock at the assembly line. 


jxfTiYr-^' 
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A number of models implemented in practice works with a given service level which is 
held to be cost minimizing. In OR Literature, one also finds a number of complicated 
stochastic service level models, e.g., by H. SCHNEIDER, CH. SCHNEEWEISS, 
J. ALSCHER and M. KUHN (see ALSCHER, KUHN and SCHNEEWEISS (1986) and 
the references given there). 

The service level must, however, be chosen carefully because it influences the expected 
total costs. Fixing the service level at a specific value must strictly be a result of 
optimization (what is the optimal service level?). In fact, the service level in an 
inventory problem, in which the costs are to be minimized, depends on how expensive 
the inventory shortages are. One shoudl,therefore, work directly with the shortage 
costs instead of the given service level. This does restrict generality since the service 
level and shortage costs are equivalent. A given shortage cost is assigned a specific 
service level in optimal lot sizes and vice versa. 

In the following chapters, stochastic inventory models are discussed in which shortage 
costs are considered. Inventory managers sometimes evade the issue of shortage costs 
either by using rough approximations or by ignoring them altogether and setting 
arbitrary service levels. The first priority of inventory managers in this instance 
should be to attempt to determine shortage costs accurately. At times, however, it is 
not possible to determine exactly the shortage costs. In these cases, the determination 
of the service level is also as arbitrary as the arbitrary determination of the shortage 
costs. In these situations, it is more meaningful to vary the service level or shortage 
costs as a simulation parameter and to specify a concrete value as feasible from the 
results. 



CHAPTER 2* 

THE WILSON MODEL WITH POISSON DEMAND 

§19 POISSON PROCESS 

The POISSON PROCESS will be introduced as a preparation for the discussion of 
inventory models with stochastic demand. It goes back to BORTKIEWITZ who 
studied the number of officers in the Prussian Army who were killed by horse kicks. 

We derive the Poisson Process using an example of a demand for spare parts. Demand 
always occurs when a defective part is to be replaced. 

A single Part: 

Probability of a defect during At : p* At; 

Probability of no defects during At: 1 — p • At. 

n Parts: 

Under the assumptions: 

a) the parts do not influence each other; and, 

b) the probability of p defects is the same for each part 

the probability of u defects in At is: (^)(pAt) u (l - pAt) 11 "”* 1 . 

Very many parts: 

Assumption: 

n -> oo, p -> 0, but np is finite: n-p = A = constant 
Probability of no defects in At: p Q (At) 

p 0 (At) = lim(“)(pAt)°(l-pAt) n 

n->oo 

= 1 im (1 - jL At) n = e -AAt . 

n->oo 
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Observation of defects over time: 

Assumptions: 

a) infinite no. of parts 

b) the probability that any part is defective during the period At is AAt 

c) the probability that many parts are defective in At (At < < e), is small and 
can be ignored, 

d) individual defects are independent of each other. 

We already know that 


P 0 (t) = e~ At 


(19.1) 


and now compute the probability of u losses in [0, t], u = 1,2,...: 


H= 




At 


t - r 


-/S. 


0 


No Defects 


No Defects 


Defect 

Position of At at anytime [0, t] 



p 1 (‘) 

P2< 1 ) 

= } e~ Ar A- A(t - r)e~ A ^- r )dr = ^ e~ At 
o z 


= } e -Ar Ae -A ( t_r W = } Ae" At dr = Ate -At 

O 0 

t 

= /PoMApjCt - r)dr = 


in general, u defects in [0,t]: 
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(19.2) 


POISSON PROCESS 

Hence, the occurrence of a demand according to a Poisson process (the so-called 
Poisson demand) is described as the demand for each single component where the time 
between two successive demands is exponentially distributed (19.1). As before, we 
have 

A : Constant demand rate 


Important Property of the Poisson Demand 

Constant demand means that the probability of an instantaneous demand is 
independent of the time elapsed since the occurrence of the last demand. This " lack of 
memory" is unique for continuous time considerations. There is no other continuous 
demand type with this characteristic. Lack of memory means 

P 0 (t + At) = P 0 (t)p 0 (At). (19.3) 

This is the defining functional equation of the exponential function. 


A pplication: 

It can be seen clearly that the assumption of a Poisson demand is relevant when there 
is a very large customer base whereby each customer orders sporadically and 
independent of the others. The so-called " lumpy demand" falls under this situation. 
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Poisson Distribution: 

If demand is described by the Poisson process, then the demand occurring at period 
t = 1 is Poisson distributed 



, U 6 W Q . 


(19.4) 


The Poisson distribution belongs to the so-called FAMILY OF BINOMIAL 
DISTRIBUTIONS. This family includes 


the Bernoulli Distribution: 
the Binomial Distribution: 
the Negative Binomial Distribution: 
the Geometric Distribution: 


p 0 = i - p; Pi = p; o < p < i; 

Pu^Op^-p)™ : 

Pu^Cx-p^-p) 11 : 

p u =( i -p)p u • 


2 

To conveniently compute the expected value /z and variance a , we use the method of 
the generating function. 


Method of the Generating Function 

Let pp p^ ••• be a discrete distribution of an integer-valued random variable u. The 
function 


C( X ) = l p u x u 
u =0 


is called the GENERATING FUNCTION. It is 


G '(x)l x=1 up u =m x =// 


Expected Value 
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G'( x )l x=1 = li 


(19.5) 


G,/ Wlx=l = I u2 Pu-I U Pu = D1 2- m l 


2 2 2 
Since a - iru — m,, one obtains the variance a 


= G"(x)| j + G'(x)| t - [G'(x)| x=1 ] 2 


(19.6) 


For the Poisson Distribution, 


G(x) = y^e- A x u = e J ( x -') 

u 11 • 


u=o 


For the Poisson Process, 


G(x,t) = yl^e-V = e At ^ x " 1 ) 

4J U • 


11=0 


For the Binomial Distribution, 


= A a - \ 


\i. - At 


't = At 


9 

G(x) = (1 — p + px) n A [i = np a = np(l - p) 


For the Negative Binomial Distribution 
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For the Geometric Distribution 



- p 

<r - p , 

** ■ 

(1 -p) 2 


The generating function has the following characteristics: 

a) A distribution <p corresponds 1:1 to its generating function 


b) The generating function of the convolution of distributions <p 1, <p2 of two 
independent random variables is the product of the generating functions of each 
of the distributions 


<P 2 




(x) -G 



c) Let v = f(w), w is a random variable with distribution <p , then the generating 

w 

function of the distribution ip is the function G y (x) 

« - O (f(x)) 

*w 


With these characteristics, the relationships within the family of binomial distributions 
can be shown. We will use the last property to develop the first two moments of the 
so-ealled stuttering Poisson process. 


Compound Poisson Process 

An event described by the Poisson process means " demand for one part". Under a 
compound Poisson process, many parts (or none) may be demanded per event. The 
time between two events is, as before, exponentially distributed. The number of 
required parts per event also obeys a distribution. 


Example: 

A beer salesman sells door-to—door by talking with prospective clients. The duration 
of each sales talk is exponentially distributed. We consider the end of a sales visit as an 
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event. As a result, u cases of beer may be sold, u = 0, 1, 2, .... Let 

w u : Probability that each sales talk will result in u cases of beer being sold. 

Case 1: 

Let w u be Bemouilli distributed, i.e., 

Wj : Talk is successful, sale of a case of beer 

Wq : Talk is a failure, no sale 

w 1 = 1 — w =: w 
1 o 

w u (n): Probability of u successes out of n sales talks 

wW = (1 — w)w^ 

u v ' u u-1 

v< n ) = ( ;)w u (l-w)« Binomial Distribution 




e (l — w)At 
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, u elN o . (19.7) 

These probabilities describe a compound POISSON PROCESS (with Bernoulli 
distribution). 

The generating function is 



G ( x,t )p.Bern. _G P.( G Bern. 

= e At [ G Bern.W- 1 J 


- [1 — w + wx — 1] 

_ 0 wAt [x - 1] 



The corresponding distribution is the compound POISSON DISTRIBUTION (with 
Bernoulli's distribution) 



Case 2: 

Let w u be geometrically distributed: 


w = (1 — w) w 11 , 0 < w < 1. 

The compound process is called the stuttering Poisson process , w i.e, we compute 

the probability that u cases of beer will be sold in n sales talks over the generating 
function 



1 — w 

geom. ” 1 — wx 
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The generating function of the n-fold convolution of the geometric distribution is 

w*W] D = [f^] n • 

If one compares this expression with the negative binomial distribution, one sees that 
the n-fold convolution of the geometric distribution results in a negative binomial 
distribution with the power -n 


\ ID x—n 1 , nz , n(n + l)z‘ 

NB: (1 —z) = 1+^ + -^— k-y- 1 - ■ 


(n + u - 1) !z u 
• + (n -'i Tur + 


■ i <‘tv • 


U=0 


Hence, 


x— n v /n+u— lx u u 

~ wx ) = l ( u ) w x ’ 


u=o 


and, therefore 


00 



STUTTERING POISSON PROCESS 
(with Geometric Disrtibution) 


(19.8) 
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The generating function is 


G ( x > t ) S t.P. 

geom. 


=e Awt ^T 


1 


wx 




v(1 + w)At 
(1 - w) 


2 2 
In this case, a > \i which is different from the pure Poisson process (where a = //). 

2 

If the conditions point to a Poisson distribution but the empirical data indicate that o 
> /i, then this case may involve a mixed Poisson distribution. 


Mixed Poisson Distribution 

With the mixed Poisson distribution, the rate A is itself distributed with 
yp(A) dA: (generalized) probability density function of A. 


Then 



, U6W 0 . 


(19.9) 


MIXED POISSON DISTRIBUTION 


The generating function is 


G(x) 


00 


gem.P. 


= I e 


A(x- 1) 


G(A) dA 


0 


G(A) is the generating function of the distribution of A. 
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/i = / A6(A) dA 
o 

(P* — f AG^(A) dA < //, since G^ < G for 0 < G < 1. 

520 GENERAL REMARKS ON CHANCE 

A stroke of chance or coincidence is often associated with superstition or a flight into 
irrationality. Scientifically, " chance” is explained through the term probability. 
CHEVALIER DE MERE provided the impulse to deeper mathematical considerations 
when he asked PASCAL in a letter to make some statements about the prospects of 
winning in a prematurely ended card game. (RENYI (1969)). 

A new situation arises in inventory: the earlier cost-profit decision criteria for the 
choice of the best course of action are now dependent on chance. In retrospect, it is 
then possible that the decision of an idiot would prove to be the best while that of an 
expert would be the worst. This, however, is an exception. In the long run, a valid 
decision rule will always prove to be better (according to the law of large numbers). 
"Only the competent have luck in the long run." Here lies the essential justification of 
OR in situations under risk. 

For a process subject to chance, the extent of a desired result is typically determined 
by a specific course of action, i.e., a selected action, which controls the development of 
the process, is also subject to chance. Its probability distribution is known. 

The choice of an action may be traced back to the corresponding probability 
distribution of the result. Hence, a criterion is needed for the choice of a distribution. 
Originally, one used the EXPECTED VALUE CRITERION. 

Let 

P : the distribution corresponding to action a of result x 

a 
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Then, by the expected value criterion 

is better than Ep {x} > Ep {x} . 

a l ®2 

An objection against this criterion is the so-called PETERSBURG PARADOX: 

One tosses a coin a number of times until a "head" appears. When this happens at the 

1 1 m 

n tn toss, one wins x = 2 n from the bank. The distribution <p of winning with the 
action a^ = "play the Game" with prob(Tail) = prob(Head) = 0.5 has the infinite 

expected value 


E P « = I (£) ’ 2 n = 1 + 1 + ... . 
a l n=l 

The action 2 ^ = "do not play" does not give any return. According to the expected 

value criterion, a player must also be prepared to pay a large entry fee to the bank 
before he can play. In reality, no one is inclined to pay such a large amount. 

The above paradox can be solved with the help of the criterion of expected utility 
which was introduced by and named after DANIEL BERNOULLI (1738). One 
measures the utility of money instead of the monetary payoff. 
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Utility 



u(x ) : Utility of the result x 

A utility function is rarely linear. In general, it is upper bounded. As long as it 
remains unbounded, new paradoxes can be constructed. In general economic terms, 
the utility function is assumed to be concave. 

The expected utility is 

E p (u(x)} = Ju(x)dP (x), 
a 

and according to the utility criterion 


a, is better than a** Ep (u(x)} > Ep (u(x)} . 


1 
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How does one know that the decisions based on the BERNOUILLI PRINCIPLE are 
good? One first tries to clearly derive, as much as possible, some enlightening 
consequences from this principle. As one finds more plausible consequences, the more 
plausible the Bernoulli principle becomes. 

JOHN V. NEUMANN and OSKAR MORGENSTERN established an axiomatic 
system for rational behavior which implies the Bernoulli principle, the so-called 
"Utility Axioms". These axioms are plausible in themselves, although there are some 
doubts (ALLAIS). 

A detailed discussion of decision theory under risk and uncertainty is found in 
CH. SCHNEEWEISS (1967) and DE GROOT (1970). 

§21 INTEREST, CONTINUOUS INTEREST, PRESENT VALUE 
Interest 

Why is there interest? Obviously, a unit of money has more value today than one year 
later, even if the inflation rate is zero. The reason lies in the fact that the use of 
money brings with it a return, which can be paid as interest i. Discounting is used 
when interest goes back in time. 

Now in a Year 

, Interest , , . 

1 -> 1 + l 

„. 1 Discount i 

p -m ( - 1 

i Interest; 

1 + i: Interest Factor; 

l/(l+i): Discount Factor p\ used to compute the present value of a future return. 
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Continuous Interest Payment 

The annual interest rate is usually i. For semi-annual payments, capital grows by a 
factor of (l+i/2)^, by n payments per year (l+i/n) n . Taking limits, one obtains the 
continuous growth rate of capital 


Continuous interest payment: 


1 im (1 + j) = e* . 

n-> oo 


Since 


.2 

e 1 = 1 + i + + ... > 1 + i 

continuous compound interest is larger than discrete simple interest. An annual 
interest of i corresponds to an interest intensity r < i. 


Interest Intensity r: 


1 + i = e r 


r = ln(l + i) = i 



( 21 . 1 ) 


The discount factor p resulting from (21.1) is 



( 21 . 2 ) 


Present Value 

We now consider a flow of future payments which occur at equidistant time periods 
(year-end) t = 0,1,2,3,... For the decision problem it is necessary to evaluate the flow 
of payments relative to a specific time period. One usually chooses the last period or 
(often) the present time period. In the latter case, the so-called present value (instead 
of the final value) is computed. The present value is preferable for decisions in the 
current period. 
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Let 

: Flow of payment, t = 0,1,...T 

G : Present value. It is defined as 


T T 



Aside from the volume of money, the average payment 
C: Average value 

is an important indicator of payment flow. C is the average payment per unit time 

T 

c = rrr 1 h ■ 

t=0 

There is a relationship between C and G. Thus, for large T and p very near 1, we have 
z Q + pz ] _ + ... + p T z t « C(1 + p + ... ++ p T ). 



For stationary models with an infinite planning horizon, i.e. all the z t ’s are identical, 
the flow of payment is infinitely long. Then 

(21.3) 
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522 INVENTORY WITH POISSON DEMAND AND IMMEDIATE DELIVERY 

One of the simplest stochastic inventory models is inventory with Poisson demand and 
immediate delivery. This model is interesting because it is handled with methods 
which are different from the previous models. Since a Poisson process is assumed for 
demand, the time since the occurrence of the last demand does not play a role. 

Inventory is considered as a business which yields profit. The present value of future 
profits is dependent on the starting inventory y: G = G(y). We formulate G(y) 
recursive in time in which we divide the future into two parts: a small time period At 
lying immediately ahead and the rest of the periods. Because of the Poisson demand, 
it is not necessary to use t as an explicit argument of G. For all y we have 

G(y) = -hyAt + (1 - AAt)G(y)e + 



+ AAt[b + Max {-k - a(D - y + 1) + G(D)e~ rAt | G(y-l)e rAt }] (22.1) 

S -X-' t s ---' s ---' s ---' 

4) 5) 6) 7) 8) 

1) Inventory costs during At 

2) Probabity that no demand occurred in At 

3) Present value after At 

4) Probability that demand occurred in At 

5) Sales Revenue 

6) Order Costs 

7) Present value after At, if an order occurred in At 

8) Present value after At, if no order occurred in At 

Note: With a Poisson demand, only one unit is demanded per event. 

The recursion (22.1) formulates the ” Principle of Optimality" of dynamic 
programming (BELLMAN’S Principle of Optimality; BELLMAN (1957), 
BECKMANN (1968)). 
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This equation is explained as follows: 

The present inventory is y. The present value G(y) is the value of all future costs and 
revenues based on the present. It is written on the right side of (22.1) but is now 
separated into the time span At and the rest of the periods. At first, inventory costs 
accumulate in a small time span At (Term 1). At the end of At time periods we add 
the profits from the rest of the periods to the accruing inventory costs. This value 
depends on whether or not an order occurs after At and, in case demand occurs, 
whether or not one orders. 

Case 1: No demand occurs: Terms 2) and 3) 

Case 2: Demand occurs with probability AAt 

Case 2a: Stock up to D: Terms 6) and 7) 

Case 2b: do not order: Term 8) 

In any case the future costs and profits of the remaining terms must also be discounted 

r/\| 

into the present; hence, the factor e . The inventory cost hy is not discounted 
within the time period At (one may interpret the inventory cost so that the 
discounting, i.e., the interest cost, is already contained in h). 

The solution of the functional equation (22.1) determines a corresponding optimal 
decision for each y, i.e. the action which yields the maximum on the right hand side of 
(22.1). Since the functional equation is solved for all valid y, one obtains for each y an 
optimal course of action and hence, as a whole, a decision rule or policy. 

In the given case the decision rule is already prestructured. D is no longer the lot size, 
but the amount which must be kept in stock. On the other hand, an order should be 
placed only when the inventory sinks to zero. If one places an order at y Q >0, then one 

would hold a constant ’’floor stock” y Q which would never be used. Because of this, D 

is again the lot size and from (22.1) 

G(y) = —hyAt + (1 — AAt) G(y)e + AAt (b + G(y—l)e r ^),y>l, (22.2) 

G(l) = -hAt + (1 - AAt) G(l)e _rAt + Mt(b-k-aD + G(D)e _rAt ). (22.3) 


(22.3) is the boundary condition to the differential equation (22.2). 
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If one approximates e -r 1 by 1 — rAt, one obtains 

G(y) = —hyAt + G(y) — G(y) (A + r)At + AAtb + AAtG(y — 1) + o(At)^ 

G(l) = —hAt + G(l) — G(l) (A + r)At + AAt(b — k — aD) + G(D)AAt + o(At)^ . 
A 2 

With p := ' — and omitting the term o(At) from the above equation, we get 


G(y) - — ^by + pb + pG(y — 1) , 


(22.4) 


G(l) =-Jh + ?b + />(-k-aD + G(D)) . 


(22.5) 


Equation (22.4) is a differential equation of order 1 with the boundary condition (22.5). 
The solution of this differential equation is obtained through successive substitution 


G(D) = ^b-^ + p(pb- ^ .-- U + p(pb- ^ *) + 

+ ... + p (/?b — • 2 + pG (1))...)) 


ih(D - 2 


1 - p 


T-r 



D b 1 

u bp V ^ i D-l 
-~T 1 - p 

i=0 
D-l 

v / t* i \ i . D 1 /1 


(k + aD) 


: p ~ d I 1 ( D_i )^ 1 + P 

1 - P i=0 


(k + aD) . (22.6) 
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To interpret this formula, let 


1 

X 


r 

X 



1 


1 + 



: Average interval between two demands 
: Applicable interest rate for the interval j 
: Discount factor for the time interval i 

: Discount factor for one cycle 
: Present value of total profits 



D—1 

x 

i =o 


: Present value of all cycle costs 


: Average inventory cost of a cycle 
(discounted within the cycle) 


Assume we are in decision period zero when there is no inventory in stock. The 
equation 

G(0) = -k — aD + G(D) 
applies. 

We substitute G(D) from (22.6) and obtain 

-k - aD - ^ y (D - i)/ , 

G(0)=- X -^ - + Pr^~o • ( 22 - ? ) 

1 - p u 1 p 

The numerator Z of the first fraction on the right hand side of (22.7) represents the 
cost per cycle. Z/(l - fP) = Z(1 + /P is the present value of all cycle 
costs. The term pb/(l - p) is the present value of total profits. (Note: No demand is 
lost). It is independent of D. 
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Only the costs are dependent on D. These indicate that one could have easily 
formulated the problem as a cost minimization problem instead of a profit 
maximization problem. We now want to see from (22.7) exactly how the cost 
minimization problem proceeds. 


D—1 

The summation V (D - i)p l can be converted to 

i =0 


D-l 

D-l D-2 

0 

l (D-i )p i 

= Ip }+ 1 

p i + • • • + Ip 

i=0 

O 

II 

•'— 5 

o 

II 

j=o 


1 D 1 D-l 

_ 1 - P + 1 ~ P t .. 

" 1 - p 1 -p ••• 


; -.p 
i - p 




From this, (20.7) becomes 




b k + aD h 


D 


P 1 - / A (1 - p){\ - p u ) 


nr* 1 ? 


1 


(1 ~P) 


2 • 


const. 


const? 


( 22 . 8 ) 


The last term in this equation is a constant which is exactly the present value of the 
profit /?b/(I — p) such that (22.8) takes the form 


G(0) = Constant - C, 

The term Cp which is dependent on D, contains all negative terms of the profit 
function G(0). It represents, therefore, all costs. 
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The resulting cost minimization problem , with k + aD =: K, is 

C = Uil - A + . h -g D Min , 

1 A(1 -/>)(! -p D ) D 


or after simplification 


C = 


KA(1 - p) + h^D 
1 - o u 


-* Min 
D 


(22.9) 


C is convex. Hence, the minimum is determined by dC/dD = 0. We substitute 


p := e 


and obtain 


dC ! 


iO « [aA(l — p) + h/>] (1 — p ) — rp [(k + aD)A(l — p) + fyD] = 0 


aA(l - p) [1 — p* — rD^] + hp[l — p^ - rD/j^J = rp®kA(l — p) 
[aA(l — p) + hp] [1 — p^ — rD^] = r/?®kA(l — p) 


p — 1 — rD = 


rkA 
aA + h ■ 


- P 


e rD - 1 - rD = 


aA + h 


e r - 1 


( 22 . 10 ) 


We expand e r ^ into a Taylor Series 


r 2 D 2 r 3 D 3 
T" IT ••• 


rkA 


aA + h 


e r - 1 
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For r<<l, e r — l«r and one obtains the estimate 

2 n 2 2 , , 

r D ^ r kA 

~T~ ~ aXr T1 



( 22 . 11 ) 


This result shows that the interest-created effect can be interpreted as an increase in 
the inventory holding cost rate from h to aAr + h. The higher the interest rate is, the 
smaller is the optimal lot size. 


In limits, as p -* 1 



( 22 . 12 ) 


The result in the undiscounted case may also be directly derived from (22.8). From 
(21.3) we know that 

C/ T n. = 1 im (1 — p )G . 

( T ^ i P 


Hence 


k + aD + 


lim (1 — p)G (0) = lim (1 

/Hi p p-> 1 




D-l 

I I O-Op 1 
—] 


1 - p 


(l-p)G (0) 


lim (1 -p)G (0) 

/Hi P 


D-l 


k + aD + £ I ( D - 


= /)b- 


I . 


i=0 


7-[Y-T 

1 + P + ft + ... + /? 


b 4-a4.M 


D 


7 


= Constant — C 


1,0 = 1 ’ 
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and we obtain, as in the deterministic model, the equivalent cost minimization 
problem 


Min 

D 




whereby (22.12) also follows. 


523 POISSON DEMAND, NO DISCOUNTING 

In the previous sections, we discussed the case of no discounting by the " backdoor” 
method of the limiting value p -> 1. Now we want to formulate the corresponding 
model with the help of the Principle of Optimality. 

The case of no discounting contains conceptual difficulties since some of the present 
values of revenues and costs become infinitely large. In this case the minimization of 
the growth rate of cost represents a suitable objective function. The resulting model is 
discussed in this section. 

Under the present assumption that no shortages are allowed, it is reasonable to ignore 
the sales profit. Since discounting is ignored, shifting the time period for realizing 
profits does not have any effect. The only important thing is the total profit. Since 
the total revenues from the inventory control policy are not influenced, we choose a 
cost approach. 

Let 

e : Planning horizon 

i e (y) : Loss function with inventory y and planning horizon © 

l(y) : 1 im L(y) , if it exists 

©->00 
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Since, in the undiscounted case, the costs are proportional to time t in the long run, 
1 (y) will increase asymptotically linear for very large ©. 



Figure 23.1: Asymptotically linear total cost 
Therefore, in the stationary case, 

C : Cost growth rate per unit time 
is a constant value. For © -»oo, 


l e (y) =CAt + l^ t (y) . 


(23.1) 
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From the recursive method (Note: the planning horizon is shortened with increasing 
calendar time) 


l 0 (y) = hyAt + (1 _ '*At)l e _ At (y) + ''Atl e _ At ( y -l) , y > 1 
l e (l) = hAt + ( 1 -AAt)l 0 _ At (l) + AAt[k + aD + l e _ At (D)] , 


(23.2) 


becomes 

CAt + le_ At (y) = hyAt + (1 - AAtJl^^y) + AAtl^^y - 1), y > 1 

CAt + le_ At (l) = hAt + (1 - AAt)l e _ At (l) + AAt[k + aD + ^(D)]. (23.3) 

It is again assumed here that there will be an order only at y = 0. We must show that 
this naive method is successful, i.e., whether reasonable results for 1 and D can be 
derived from it. 

We represent (23.3) in the form 

CAt + AAtl Q _ At (l) = hAt + AAt(k + aD) + AAtl Q _ At (D) 

CAt + AAtl Q _ At (2) = 2hAt + AAtl^O) 

(23.4) 

CAt + AAtl Q _ At (D) = DhAt + AAtl @ _ At (D - 1) . 

By summation of these equations, the first terms drop out. What remains is 

D 

DC = h ^ i + A(k + aD) , 
i = l 



82 


CHAPTER 2: THE WILSON MODEL WITH POISSON DEMAND 


hence 

c = hD(D^+ 1) + Ak + Aa . ( 23 . 5 ) 

This is the stationary cost rate (cost of cycle per unit time). It is necessary to 
minimize: 

^ + Aa „ Min . 

We already have the same objective function in the deterministic model. Hence, the 
WILSON formula also follows with Poisson demand without discounting 



(23.6) 


How does one interpret C in a stochastic sense? 


The different stocks y = 1 , 2 , ..., D are the possible states of the system. The 
probability jr , with which the system finds itself in state y, is the same for all states 

since demand rate A is constant 



y 




We now write (23.5) in the form 


D 


C = 


h 5! y U + p A(k + aD) 

y=! TT 


1) 2)3) 


(23.7) 


1) Inventory is y with probability 1/D 

2) Probability of finding the system in state y = 1 

3) y = 0 with rate A and the cost k + aD is incurred 
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so that (23.7) can be interpreted as the expected value of the costs of a cycle 


D 


c- 1 


y-l 


C 7T 

y y 


Cy : Average cost of a cycle per unit time in state y. 

The state probabilities (in the present case 7r = are dependent on the lot size D. 

The method, which determines D by the minimization of (23.7), is called the 
METHOD OF STATE PROBABILITIES (more in section §31). 


It is interesting to note that, in the present case, both methods lead to the same 
objective function but in different forms. 


§24 RECURRENT PROCESS 

Now let A = A(t) be dependent on the time which has elapsed since the last event. 
This situation, for example, can occur in a newsstand which is located at a bus stop. 
The customers are mainly the bus passengers. As a rule, the bus is not punctual. The 
interval between two arrivals is then stochastic. Under the above assumption, the 
probability of arrival is dependent on the time since the last arrival. The longer it 
takes for one to wait for the bus, the higher is the probability that it will arrive at any 
moment. 

The state space is now two-dimensional: the inventory y and the time t since the last 
demand. Let 



Cost function with starting inventory y and planning horizon 0, whereby 
time t has elapsed since the last demand. 



84 


CHAPTER 2: THE WILSON MODEL WITH POISSON DEMAND 


We assume that the cost function increases linearly with time for a fixed y for © -»<» 
and with a rate C. Then the method "total cost tomorrow = total cost today + C" 
which is analogous to (23.1) leads to 


l Q (y, t + At) = l & _ At (y,t) + CAt . 


(24.1) 


Corresponding to (23.2), the recursive method is 

l Q (y,t + At) = hyAt + [1 - A(t)At] l 0 _ At (y,t + At) + 

+ A(t)At MinJU., (y - 1,0), Min{k + ax + Iq*. (x,0)} , 

w-at t x>0 y>l 

since demand has now occurred 


l 0 (l,t + At) = hAt + [1 — A(t)At] l 0 _^ t (l,t + At) + 

+ A(t)At Min {k + ax + Iq^^Cx^O)} . 
x>0 


(24.2) 


(24.3) 


Since cost increases are stationary (© -> oo ) we need not consider the planning horizon e 
explicitly and, therefore, may drop the index © or © - At. 


The functional equations (24.2), (24.3) are discrete in y and continuous in t. The 
following computation shows that it can be transformed for At -> 0 such that the cost 
function 1 is dependent only on y at time period t = 0. The process is interesting only 
at the transition periods (these are the renewal points for the function A(t) where A(t) 
is reset to the starting value A(0)). Such a process is called a recurrent process . 



We again have the same assumption: Order quantity x(y) = \ 




D 


for y > 0 , 
for y = 0 . 
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Then from (24.2) and using (24.1): 


t + At) + l(v,t 


+ A(t)l(y,t + At) = hy - C + A(t)l(y — 1,0) 


and for At-* 0, we have the linear differential equation 


<?1 (v,t 


+ A(t)l(y,t) =hy-C + A(t)l(y —1,0) 


(24.4) 


Intermediate computation: we solve (24.2) by integration. Again (24.4): 

-1 + A(t)l = hy-C + A(t)l(y — 1,0). 

The integration becomes easy if the left hand side of the integral is a product 1 -f. 
To achieve this, we multiply (24.4) with f(t), the so-called integrating factor 


—f A(x)dx 


f(t) = 


The left hand side of the above equation becomes 


-If + lAf, 

and because of the special form of f, it is identical to the derivative of If 

> 

-If + lAf = -|f(lf) • 

Hence (24.4) becomes 

-|(lf)=[hy-C + A(t)l(y-l,0)]f, 

and only the right hand side remains to be integrated. One obtains by using partial 
integration 
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t t 

—j A(x)dx —f A(x)dx 

00 oo A 00 A 

/ f(t)dt = / 1-e 0 =tf(t) IJ+ JtA(t)e 0 dt . 

o o v- v -✓ o \- v - y 

1 ) 2 ) 

V- s/ - / 

3) =: a 

1 ) Probability that no event occurs until time t 

2) Probability that an event occurs in time t 

3) Expected value of the time interval until the next event 


The integration of the complete equation results in 


‘If I o = (hy — C)a + 1 (y 


—j A(x)dx 
00 J 

1,0) J A(t)e 0 


0 


-j A(x)dx 


• lf l”= (hy-C)a+ l(y — 1,0)(—l)e 0 


00 

0 


We let l(y,oo) -0 = 0 and obtain as a solution from (24.4) 
l(y,0) = (hy -C)a- l(y - 1,0) . 

With this, the time t is eliminated and we get the intermediate result 
l(y) = (hy-C)a- l(y - 1) . 

If one expands the recursion (24.5), one obtains 


1 (y) = a(hy-C) + 1 (y — 1 ) = 


(24.5) 


= a(hy — C) + (h(y — 1) — C)+[...+a(2h — C + 1(1))] ...]] , 
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and therefore 


y 

l(y) = o -I (hi-C) + 1(1) , 1 <y <D (24.6) 

i=2 

1(1) = a(h — C) + k + aD + 1(D) . 

(24.6) is a system of D equations in D +1 unknowns 1(1), 1(2), ..., 1(D), C. The lot size 
D is considered as given and later determined by minimization. From the chosen 
optimization criterion "minimize the stationary cost increase per unit time", it follows 
that the optimal D is dependent only on the relative values of 1. Hence, since one of 
the l(y) can be arbitrarily chosen, we set 


1(1) := a(h-C) 


and obtain 


i(y) = 


a 


y 

l 

i=l 


C) 


or 


l(y) - g hy . (y„+ 11 _ ayC 

1(1) = a(h - C) 


1 <y<D . 


(24.7) 


For a Poisson Demand 


o = 7 tAe -At dt = { 


o 


5 
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and (24.7) becomes 

i(y) = hy(^r - 1 l_^c -1 < y < D t ,24.8) 

l(l)=}(h-C). 

This specification, however, is not essential. 

l(y) is the value function of dynamic programming. By summation of the equation 
(24.6) one eliminates the l(y) and, in this case (compare (23.5)), obtains also the cost 
function 

c = hilWJl + Ak + Aa ^ (24.9) 

from which the optimal lot size results 



(24.10) 


§25 PROOF OF OPTIMALITY 

The above results were derived under the assumption that the optimal ordering rule of 
the form "order D, in case y=0" has a given fixed structure. Optimization was carried 
out only under this type of an ordering rule. Assumptions and results seem to be 
feasible. Still missing, however, is the proof which shows that the optimal ordering 
rule has such a given structure. 

An optimal value function which does not assume any specific ordering rule obeys the 
functional equation 

l(y) + CAt = hyAt + [1 — AAt] l(y) + AAtMin {l(y — 1) ,Min {k + ax + l(x)}}, y >1, 

x 

1(1) + CAt = hAt + [1 — A At] 1(1) + AAt Min {k + ax + l(x)} . 

x 


(25.1) 
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We now show that our results (24.8), (24.9), (24.10) satisfy these 

functional equations. We substitute 1,C and D from (24.8), (24.9) and (24.10) into 

(25.1). 


l(xl substituted: The minimization Min { } gives 

Min{k + ax + (convex!) 

d ! n h h C _ A 

E -0: a + jX + jj-j-0 


_C 1 A 
x 'K - 2 -a K • 

C substituted from (24.9): 


_ D + 1 Aa Ak 1 \ n 

~ 2 ITfin _ 2~K a 


X 


_ 1 /i) . 2kAs 

"2^ + kih • 


(25.2) 


D substituted from (24.10): 


1 D 2 

x “ 2^ + D~) = D 


Hence 


Min {k + ax + 1 (x)} = 0 
x 


and 


Min {l(y — 1) ,0} = l(y-l) , y > 1 , 


since 


i (y) [lili-iJl - c] , 


y > i • 
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Hence, the ordering rule " order D, if y = 0" satisfies the principle of optimality (25.1). 
It remains to be shown that it is the only solution to the functional equation (25.1). 


Let x = D' < 



be another order quantity. Substituting D’in (25.2) 


1 u 

D ' = 2^ D + Ep) ’ 


D' < D 


leads to the contradiction 



(25.3) 


The assumption x = D' > D also leads to a contradiction. 


Remark 1: 


* ?Ak 

D = is the only real—valued solution to (25.1). However, since y 

and D are limited to integral values, two neighboring order quantities 
and D 2 (D 2 = Dj + 1) are optimal. 


Remark 2: It can be shown that the computational steps 

1) Choose D' 

2) Compute l(y)|D', and C(D') from (24.8), (24.9) and (24.10) 

3) Compute x = x(D') from (25.2) 

lead to an improvement ’^(D') is better than D'", i.e., C(x(D')) < 

* * 

C(D') as long as D'/ D . The optimal solution D is finally obtained 
after a finite number of improvements (only requirement: h, k and a are 
all non-negative). 


This method of dynamic programming is called POLICY ITERATION. A very 
detailed explanation of these methods are found in BECKMANN (1968) and 
HOWARD (1965). 



CHAPTER 3: 

STOCHASTIC SINGLE PERIOD MODELS 

§26 THE NEWSBOY PROBLEM 
Model with Proportional Shortage Cost 

The inventory models we have discussed to this point have been characterized by 
continuous stock monitoring. An order may be placed at any point in time. In 
contrast to these are the periodic models. Stock inspection and/or orders are possible 
only at discrete points in time, i.e., at the beginning of a period. If nothing is 
explicitly specified, all periods are taken to be of the same length. 

The simplest periodic model is the single period model. The decision problem reduces 
to only one period. Such inventory problems occur if the products cannot be sold after 
the period. Examples of these are fashion articles, travel offers, ticket sales for large 
presentations and daily newspapers. We formulate the last case as the basic model of 
the newsboy problem (or, in more recent terminology, the newspaper vending machine 
problem). 

Early in the morning, the newsboy buys a stack of newspapers and tries to sell these 
during the course of the day. He can only return the unsold papers at a loss. If he 
carries only a small quantity of newspapers, he misses out on a profit. Demand is 
uncertain but its distribution is known. His decision problem: "How many newspapers 
do I buy to maximize my profit expectations?" 

Let 

x : Stock of newspapers which the newsboy carries early in the morning 

p^: Probability that u copies will be sold 

P(u) : Probability that the demand is actually smaller than u 
f: Expected value of u 

h : Loss per unsold copy because of surplus 

g: Loss per unsold copy because of shortage (loss of profit and customer 

dissatisfaction) 
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In a typical decision situation, g > > h. / 

The decision variable is the initial stock x. The problem can be simplified under the 
non-limiting assumption that all possible shortages occur only at the end of a period. 
It is, therefore, enough to consider the situation at the end of the period. The time 
distribution of profit during the period can be ignored. The value x is to be selected 
such that the expected utility of the situtation at the end of the period is maximized: 

Max E{Utility at the end of the period} . 
x 

The utility function has the following form: 


Utility 



Figure 26.1: Utility function of the newsboy at the end of a period 
The objective function is then 

X oo 

Max {—h J* (x-u)p u -g l (u-x)p u } 
x u=o u=x+l 
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Min{h J (x-u)p u + g J(u-x)p u } 


(26.1) 


11=0 


u=x+l 


Since it is not convenient in practice to work with infinite summations, it is, therefore, 
useful to rewrite the objective function. We replace the summation by integration 


Min { h f (x — u)dP(u) + g f (u — x)dP(u)} = 
x o x 

x 

Min { (h + g) / (x - u)p du + g(fi - x)} . 

X 0 


Using partial integration 


(h + g) J (x-u)p du = (h + g)(x - u)P(u) |*+ (h + g) J P(u)du 
0 s___' 0 


= 0, since P(u) = 0 


one obtains the objective funciton 


Min{ (h + g) J P(u)du + g(/i - x)} 

X 0 


(26.2) 


d2 x 

It is convex since JP(u)du = p > 0. 

dx z o x 


From ^ ^ 0, it follows that 


(h + g)P(x) — g = 0 


* = rl <rb) 


(26.3) 
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The solution is easy to determine. One need not know the distribution function P. 
The information about P near the point is enough. The simplest manner of 

determining x is the following: 


Since 


P(x) = 


K+g 


then 1 -P(x) = j-J-g • 

Let g = lOh. Then 1 - P(x) = yy = 9% Hence, x must be chosen such that one 
carries fewer newspapers 9% of the time. 

We now ask: 

How large must the demand be so that it pays off to have a supply with initial 
inventory x > 0 ? An extreme solution x = 0 occurs if g/(h+g) reaches the critical 
value P(0). The business is profitable only if 


irf-g >p «»- 

In the above example, if demand does not occur at least 91% of the time, then one 
should give up the business. 


Model with non-proportional shortage cost 

To simplify the computation, the product to be stored in inventory is treated as a 
continuous variable (e.g., oil). The first applications of operations research and 
statistics were actually related to the provisioning of ships; for example, with fuel for a 
long journey. In these cases, it does not make too much sense to evaluate shortages 
with proportional costs. If three or five units of an important part are missing while 
on the high seas, then both cases are equally bad. Hence, we use the following 
approach: 
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X oo 

Min { h f (x — u)dP(u) + G J dP(n)} . 

X 0 X 


(26.4) 


G : Constant arbitrary cost when a shortage occurs 

The optimal lot size x is determined from the condition ^ ± 0, i.e. 


hP(x) -Gp x = 0 . 


Let the demand, for example, be exponentially distributed with the expected value 
1/A: P(x) = 1 -e~ Ax . Then 

h(l -e~ Ax ) -GAe _Ax = 0 

h= (h + GA)e _Ax 

x = j ' h A) • ( 26 ‘ 5 ) 

By a suitable choice of unit, h > 1 is always reached. Hence, (26.5) states that the 
stock quantity must always be greater than the expected consumption. 


527 EVALUATION OF P(x) = ^ | 

One of the most important demand distributions which occurs in practice is the 
Poisson distribution (see §19). If one considers the occurrence of demand during longer 
time periods, then the Poisson distribution is transformed into a normal distribution 
with the density 
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H : Expected value 

2 

a : Variance 


The normal distribution can be approximated by the 


LOGISTIC: P(x) =-; ■ s 1.6 . 

1 + e - ™ 


P (x,/i,<r) = 


1 


1 + e 



(27.1) 


The value m a 1.6 is achieved as follows: The density of the standard normal 
distribution at x = 0 is 1 The density of the standard logistic at x = 0 is 


d 1 

3x 1 , ^-mx 
1 + e 


x=0 


_me 

(1 + e 


-mx 


-mx 


2 



m 



Since both densities should be equally large, it follows that 


m =-« 1.6 . 


For the newsboy problem, the condition for the optimal lot size when applying this 
approximation is 


1 


1 + e 


m(x— p) 
a 


g 


1 

1 + 


5 

g 


and, hence 


e 







(27.2) 
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The optimal lot size x is a linear function of /i and a and an increasing function of g/h. 
One can see that 


g{ = }h 4 x { = } n . 


(27.3) 


We now investigate the costs. Let 

l(x) : Expected value of the costs for the single period model with optimal lot 

size x. 

For a model with proportional shortage costs, the expected costs according to (26.2) 
are 


l(x) = (h + g) \ P(u)du + g (fi-x) , 

0 


(27.4) 


and in the special case of logistically distributed demand 


l(x) - (h + g) f 


1 


-o - J(y -n) 

1 + e 9 


<iy + g(M-x) = 


m 


m a 

x - e 

= ( h + s)il — 


(y-p) 


0 *(y-p) 

1 + e a 


<iy+ g(//-x) = 


m 


lu (x //) 

= ( h + g) i inf 1 + e<r ^ ] +g(/^-x) • 


When applying the logistic, one assumes that a negative demand can be ignored. 


If the expression (27.2) is substituted for the optimal x, one obtains 



98 


CHAPTER 3: STOCHASTIC SINGLE PERIOD MODELS 


In P 

l(x) = (h + g) jln[l + e J-Si ln | = 

= i [( h + s) - g ln S = 

- 5 t" ^ 


and, finally, 

l(x) = (h + g) - [— t-——— lnr——-S—lm-S—1 . 

v ' v m L h + g h + g h + g h + g J 

We now know that the ENTROPY of the probability DISTRIBUTION is 

e(p 1 ,P 2 , • • • ,P n ) = -}» Pi I * 1 Pj , 

?i - 0, ^ Pi = is largest for a uniform distribution = ... - p n . 

The cost function l(x), therefore, reaches its maximum value with fixed a if 

h = g 

h + g h + g 

i.e., for h = g. In general, one can say: 


The expected value of the cost l(x) increases, 
if h -» g for a fixed h + g. 


Furthermore, for g > = h 

Ik > 0 and lg > 0 • 


(27.5) 


(27.6) 
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We now show ^ > 0 by differentiating (27.5). 



> 0 


Since the function l(x) remains unchanged when exchanging h and g, it also follows 
that 



Here is an example. Let a - 1 and 

a) h = g = 1; 

b) h = 0.1; g = 10 

In both cases, the geometric mean of h and g are equal to one, but 

a) l(x) = 0.77; 

b) l(x) = 0.317 

h / g means that there is a favorable and an unfavorable stock supply for the single 
period model. The result (27.6) in the above investigation asserts: the more h and g 
differ, the greater is the cost reduction by applying an optimal ordering rule. This is 
valid for any demand distribution. 
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§28 TEMPORAL STRUCTURE OF THE NEWSBOY PROBLEM 


itimal Period Length 


Instead of the newsboy we now consider an ice cream vendor in a football stadium. He 
may only sell ice cream inside the stadium. He sells until his stock have been sold out 
or until the end of the game. It is assumed that he does not replenish his stock of ice 
cream. Since he can freely choose when to begin selling, he (in a way) freely chooses 
the length of the selling period. In his case, is there an optimal period length in this 
single period problem? 

As before, we had assumed a Poisson demand which we then approximated using the 
logistic function. With a Poisson process, the expected value and variance are 
proportional to time (compare §19) 

fij-<Tj = X T , i. e. (Trji = • 

The holding and shortage costs are also proportional to T 
h T = hT; g T = gT. 

With this, the time-dependent expression for the single period cost with logistically 
distributed demand (27.5) is given as 


l T (x) = ( h + S)T exp [- in rfr - 


+ g n + g- 


The expected total costs per time are 


C = 


400 


T= ( h + g) ST JT exp [- in t-4 



- +T' 

+ g J T 
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The optimal period length with logistically distributed demand is 



A rough simplification is, however, assumed here: The random result "a demand 
occurs" is set at exactly the end of the period. 

It would be more precise to consider the time of occurrence of the demand within the 
period. 

A More Accurate Formulation 

We again assume a Poisson demand. In the model with a fixed period length (exactly 
one time unit) 

x 

l(x) = (h + g) £ P(u) +g(/i-x) . 

u=0 

Now P(u) = P t (u) and fi = /z t , and the cost function is 

T x 

l T (x) = J{(h + g) £ p ( u )+g(// -x)} d t . (28.1) 

0 u=0 

For a Poisson demand with rate A 



j=o 
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With these expressions, the objective function (28.1) becomes 


x u 


l x (x) = J • (h + g) y y At dt + g I At dt-g X T 

1 U U j * f\ 


U=0 J=0 


(28.2) 


Intermediate computation: 


| e -->* dt = - \ ^ e - AT . J e- At dt 

0 J- A J• 0 IJ i >- 


u v' = ... (partial integration continued) . 


r=o 


- J [* ~ * 


With the help of this intermediate computation, (28.2) becomes 


x u 


l,p( X ) - 


h + 


i 1 t 1 — p x( j)] +g^2“-gxT 

u=o j=0 


(28.3) 


We approximate the Poisson distribution for large AT by the logistic 


P T (n) = 


1 + e 


r T 0> - ^ 


(28.4) 
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From (28.3), we derive 


x u 


1 t( x ) = h 1 5 I J 

u=o r=o 


1 - 


1 


m 


1 + e 


ui / 

— (r 

v 




dr du + gA i— g x T . 

(28.5) 


We substitute 


u 

! 

—oo 


- At) 


1 - 


1 + e 


?(r - At) 


dr = u — - In 
m 


1 + e 


"(u-At) 


in (28.5) and obtain 


1 t( x ) = ^ L 1 _£ / ' 
1 * u=o 


u — — In 
m 


m 


1 + e 




du + g\ y -g x T. 


This cost function is now minimized with respect to x (for a fixed T). 

m L 0 s ives 


h + g 


x — - In 

m 


in 


1 + e 


^ x -A X ) 


- gT = 0 ; 


h + g m 


In 


1 + e 


J( x -^ T ) 


x — - In I 
m 


| 

e + 1 


m 


=(* - ^ T ) 


--In 

m 


*( x -/* T )i 


1 + e 


+ At . 


(28.6) 
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Since = AT and a = fIT , therefore 


AT FT-? = ? ln 
n + g m 


--=(x-AT) 
1 + e ^ 


The solution to x gives the optimal lot size 



(28.7) 


A feasibility consideration shows: 

as g/h increases, the optimal lot size x also increases. 

One can also approximate the Poisson distribution by the normal distribution in the 
objective function (28.2) 

j=0 


with n = At; <7 = |TT. The optimal lot size x, however, can no longer be explicitly 
specified. 


529 EXACT FORMULATION 

We now want to derive the exact formulation for a Poisson demand. As before, let 
u : Demand within T 

p u (T): Probability that u pieces are demanded in [o,T] 
x: Starting inventory 
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Two cases occur: u < x and u > x 


Stock Stock 




Figure 29.1: Inventory positions in two cases 


The holding and shortage costs during period T are formulated as 


f u« 


h • T • 


x + x — u 


7 


hx x T g(u—x) u - x 

r * — 1 + “-*75 —* - 

u 2 u 


for u < x , 


T , for u > x . 


(29.1) 


The expected value of this single period cost with starting inventory x is 


00 

tw- RW^e-« 


u=o 


(29.2) 
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x 

1 (x) = hT £ (x -p u (T) + Ip x* 
u-o 


V PuW 
Z u 

u=x+l 



u=x+l 


(29.3) 


whereby Al(x) is the first difference l(x + l) - l(x). 

Al(x) is now to be computed. This situation arises in many inventory problems in 
which the stock level is a discrete variable. If l(x) is not defined differently in the 
various intervals and if the summation limits do not depend on x, then the difference 
operator A can be brought into the summation sign. However, this assumption does 
not hold because of (29.1). We will show that one can, nevertheless, proceed in this 
manner (compare SASIENI et.al.). 

The holding and shortage cost function, f, is piecewise composed of two partial 
functions f^ and f 2 for all defined values of x 

f M (x) , for u < X , 

^2 U W ’ f° r u > x • 


f (x) = 
u v ' 


Let 



then (29.2) can be written as 



I V x ) • 

11=0 
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It now applies that for any monotonicly increasing summation limits a(x) and b(x) 

M x+ i) „ 

l(x+l) = J f u ( x+1 ) 

u=a(x+l) 

b( x ) „ b(x+l) ^ a(x+l)-l ^ 

= l V x+1 ) + l V x+1 ) ~ l V x+1 ) ’ 

a(x) b(x)+l a(x) 


and hence 


b ( x ) „ 

b( x+ l) „ 

Al(x) = 

t 

l Af u W + 


a 

(X) 

b(x)+l 

Since 

1 

«* 


-y- 

II 

'7? 

f l,uW ’ 

for u < x , 


f 2,u( x ) ’ 

^ 7 

for u > x , 

then 



b ( x ) „ 

00 

l( x ) = / 

> f l,uW 

+ I f 2 

11 - 

=0 

u=b(x)+l 


a(x+l)—1 

i 

a(x) 




f u( x+1 ) 


(29.4) 


whereby b(x) = x. 

To determine A1 we now turn to (29.4) for both sums on the right hand side and 
obtain 


b(x) ^ oo b(x+l) 

A1 « = I A *l, u to + l ^2,«W + l [?!, u ( x+ 0-? 2 , u^ x+1 )] 

11=0 u=b(x)+l b(x)+l 
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Since b(x) = x, the last summation is limited to 
f l,x+l ( x+1 )- f 2,x + l( x+1 ) • 

It has the value zero, since one knows from (29.1) that the equation f- (x) = f 9 (x) 

1,U Z,U 

applies for u = x + 1. One can therefore bring the difference operator into the 
summation sign. 

The optimality condition for this discrete problem is 


Al(x—1) < 0 < Al(x) . 
This leads to 


Al(x) = (h + g)T 




u=o 


u=x+l 


g T • 


The minimization of the expected single period cost means: 
Choose the smallest integer x which satisfies the condition 


> rh 


(29.5) 


(29.6) 


whereby 


* 1 v P„( T ) 

*W - 1 P U (T> * ( x *7> l -V 

u=o u-x+1 


and, especially for a Poisson demand, 
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11=0 u=x+l 


(29.7) 


If one wants to determine the value of the objective function l(x) aside from the 
optimal lot size, then one starts at best with k=0 and computes the series according to 
the value M(k) for k = 1, 2, ... until the condition (29.6) is satisfied for the first time. 
The corresponding k is the optimal lot size x. One uses the values M(k) for the 
computation of l(x). 

From 


Al(x) = (h + g)TM(x) — gT . 


one easily obtains l(x): 


x-1 


l(x) = 1(0) + l Al(k) . 

k=o 


Since 


00 

1(0) = ^- ^ U P U (T) =^-E{u} 


(Poisson) 



u=o 


then 



(29.8) 


This is the expected value of the inventory and shortage costs for a period of length T. 
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Optimal Period Length 

Until now the period length T was fixed. We now approximately compute the 
minimum average cost of a period per unit time. 


Min c(T) = Min 
T T 



(29.9) 


Simplest path: c(T) is a convex function with 1 im c(T) = oo. 

T->00 

c(T) 

A 



1 - 1 -!- 1 - 

Tl T 2 T* T 3 

Figure 29.2: Optimal period length 


► T 


We compute for three different values T\, T 2 , T, which should lie near T , the 
average costs c(T^), c(T 2 ), c(T,) and approximate c(T) by the function of type 
f(T) = £ + 0 + 7 • T. 


This is clearly determined by the three points (T^, c(T^)), (T 2 , c(T 2 )), (T^, c(T,)). 
The minimum lies at 



(29.10) 
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§30 OVERBOOKING 

A standard example for overbooking is hotel reservations: A conference is being held 
in a large hotel during the peak tourist season. The visitors register their participation 
with the organizer. The organizer, in turn, negotiates a price discount with the hotel 
manager and books overnight stays for the registered participants. 

The hotel manager knows from experience that for large conferences a number of 
registered participants fail to arrive without cancelling their reservations (so-called 
"no shows"). It may be profitable for him to keep fewer rooms reserved than were 
booked. Let 

b : Booked rooms (each participant books a single room) 

x : Vacant rooms (capacity) 

h : Reservation cost of a room for a "no show". A participant who is not coming 

pays only the discounted price. Had one known that he would not come, the 
room would have been rented at the normal price, h is, at the same time, the 
day’s discount (normal price minus the discounted price), 
g : Shortage cost. The participant whose room is overbooked arrives. The hotel 

must assume the cost of the external accomodation of the guest at a higher 
price category. 

u : Number of participants who have reserved rooms and who actually arrive 

q : Probability of a "no show" 

Given b bookings the probability that u guests actually arrive is 

P u;b = (5)( 1 -<l)V _U (30-1) 

and the accumulated probability P(u;b) = probability, demand < u with b bookings 

P(u;b)= l (y)(l - q ) y q 1_y • 

y-° 


(30.2) 
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This optimization problem is a newsboy—type problem. The decision variable x is the 
number of rooms reserved for the conference (inventory). The optimal inventory 
according to (26.3) is 


x = P 1 (—S—) 
r ^ + g' 


(30.3) 


The binomial distribution defined above has the expected value and variance 

I* = b(l -q) ; <P“ - bq(l -q) . 

If b is large, one approximates this distribution by the normal distribution (so-called 
normal approximation). Then from (30.3) 


x - 


N is the distribution function of the standardized normal distribution. If one 
approximates the normal distribution by the logistic, one obtains from the above 
relation 


1 + e" 


[x - b (1 q) ] 1 + - 


[x — b (1 q) ] h 


We solve this equation for x and obtain the following equation for the optimal lot size 
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Again 


also applies, i.e., if the shortage cost is larger, inventory is larger than the expected 
utilization. It is the other way around if the holding cost is larger than the shortage 
cost. 



CHAPTER 4: 
STOCHASTIC MODELS 
WITH CONTINUOUS REVIEW 


§31 METHOD OF STATE PROBABILITIES 


We already encountered in §23 an inventory model with continuous review. A Poisson 
demand was assumed. It was shown that under this special assumption the optimal 
order quantity D is the same as that in the deterministic model with a constant 
demand rate. D was obtained using the Wilson formula. The interpretation of the 
objective function C in the stochastic sense led to the method of state probabilities. 

In this chapter, we want to generalize the model with continuous review with respect 
to the demand process and the delivery schedule . In this case, we again use, among 
others, the model of state probabilities. The basic idea of this method can be 
summarized in three steps. 


Step 1: Determination of the structure of the optimal ordering rule in parametric 

form (for example, "order D, if y = 0"; D is the parameter with a still 
unknown optimal value). 


Step 2: 


Derivation of the stationary state probabilities. Let 7iy(t) be the 
probability that the system is in state y at time t. This means 



lim ,0) 

t-*« y 



is the probability of the state y when using the fixed lot size D. 

In general, the stationary distribution depends on the initial distribution 7 r^\o) 
and on the parameters D of the ordering rule. It can be shown that the distribution 
7T' exists for the present inventory model and the ordering rule "order D in case y = 
0” is independent of the starting distribution 
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Step 3: Minimization of the expected costs per unit time, that is the stationary 

expected value 


Y C -» Min . (31.1) 

1 y y d 

y 



Cost per unit time in state y 


In the case of a Poisson demand, inventory is uniformly distributed at any randomly 
selected time (compare §23). 

We now generalize the demand process. We assume that purchases come one after the 
other (rather than in batches). Let 

p u : Probability that a customer buys u units, u = 0,1,2,... 


We also assume a Poisson process for customer arrivals . Hence, the demand process is 
described by a compound Poisson process (compare §19). 

A time-dependent consideration, i.e., a cost recursion t -» t + At, is complicated. 
Since the objective function depends only on the expected value, one can assume that 
the customer arrivals are 1/A time units apart. This is the mean value of an 
intermediate arrival interval. It simplifies the stochastic process into a Markov chain 
whereby at each event a transition from one inventory state to another (in the same 
state if u = 0) takes place. 
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u = 0 


Figure 31.1: State transition diagram 

If demand is greater than the stock then the new state is y = 0 and the unsatisfied 
demand is lost. 


For the ordering rule we again define the known structure as 


Order Quantity z(y) 


’ 0, for y > 0 , 
D, for y = 0 . 


In this Markov chain model, the transitions take place after each 1/A time unit. For 
the state y = 0, the following conditions apply: the system persists 1/A time units in 
this state and the order is only placed at the end of the period (for immediate 
delivery!). Thus no inventory costs occur during this period. 
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The equations which determine the stationary state probabilities are: 

(31.2) 


(31.3) 


(31.4) 

There are D+2 equations with D + l unknowns; however, one equation is linearly 
dependent, since (31.2) and (31.3) only determine the values Tty y = 0, 1, 2, ...D 

relative to each other. Therefore one still needs the normalizing equation (31.4). 

Geometric Distribution of Demand 

Let the demand u of a customer be geometrically distributed 

p u =(i-p)p u ; o<p<i , u = 0 , 1 , 2 ,... . 

Then 

D 00 

^ D = (1-P) l l P U 

i=o u=i 

D 00 

= (!-P) I T iP X I P U 

i=o u=o 
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x v = 0 -P) l 'iP 1 Y 


l=y 


= (1 — P)x y + P(1 - P) J ^P 

i=y+l 


i-(y+l) 


4 r - x , 1 

y y+i 


r o = ( 1_p ) l V 1 


1=0 


= Ti 


As a whole: 


= i 


y+i 


y = 0,1,... ,D—2 


=> * 0 = 0 ~ p) x ] 


X = 

y 


t d , for y = D, 

(1 —p)x D , for 0 < y < D-l 


With the help of the normalization condition (31.4), 71 ^ can now be calculated 


i 'i=> 


14 ’D = TT 


1=0 


The stationary state probabilities, therefore, are 


j - 

y 



(31.5) 
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The objective function dependent on D can now be formulated, 
y are 


The costs 



in state 


C 


y 


' hy , for 1 < y < D , 
k + aD , for y = 0 . 


(31.6) 


The expected cost per unit time (31.1) in this case is 


D-l 


hDxp + h ^ yr + (k + aD)T Q 


y=i 


Min . 
D 


If one substitutes for the state probabilities the values found in (31.5) the following 
objective function is obtained 


k + aD + h 
1 - P 



+ 


hD 

1 + D(l-P) " 


Min 

D 


(31.7) 


The first fraction has a similarity to the objective function (2.1) of the Wilson model. 

* 

From (31.7) no expldt formula can be derived for the optimal lot size D . The 
objective function (31.7), however, can be easily evaluated. The recommended method 
is to start the evaluation with the discrete Wilson lot size and to continue the 
computation within a discrete neighborhood until one has found the minimal value. 


§32 POISSON DEMAND, EXPONENTIAL DELIVERY TIME 
Now we consider models with delivery times. 

Let the demand be Poisson distributed and the delivery period exponentially 
distributed. 

A : Demand rate 
/i: Delivery rate 
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Since the delivery period is greater than 0, it will never be optimal to order when 
y = 0. An order is placed when y = s > 0. 

Let the lot size be D. Then the value 

S = s + D 

is the maximum inventory from a given period up to the time when the stock first 
reaches the point s. 

Since the stock is continuously reviewed, an order is placed exactly at y = s. 
Inventory can be allowed to drop until it reaches this value again. However, another 
order is not allowed until the last order has been used up. 

The ordering rule is a type of the so-called 

(s,D)-policy 

also known as the Two—Bin Policy . 

It was practiced in the past by herring sellers. They had an open barrel and, in 
addition, a closed barrel in reserve. As soon as the open barrel is emptied, the second 
barrel is opened and a new barrel is immediately ordered. 

For models with delivery time, 

D > s (32.1) 

is more reasonable. If D < s and the stock is allowed to drop to y = 0 before the 
arrival of the delivery, then the new stock after the arrival of the delivery would be y 
= D < s and one must immediately order again. 
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Figure 32.1: Operational characteristic of inventory for a (s,D)-policy. 

B = order; L = delivery; L — B = delivery time 

Sales are lost as a result of stock deficits (Lost Sales Case). 

How large are the state probabilities in this model? We consider the following cases: 
Case 1: y = 0 

The state y = 0 takes a special position as a boundary point of the state space. The 
state transition diagram related to a small time period At looks as follows: 


juAt 



1 — ^ At 


Figure 32.2 
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The arrows are the transitions probabilities. The probability of remaining in state y=0 
or arriving in it after a small time span At is 


x (t + At) = [1 — //At] x (t) + AAtXj(t) 


(32.2) 


As At -» 0, it becomes 


d^ 0 (t) 

_ d _ - = -/JT Q (t) + AXj(t) . 


In the stationary case, lim tt (t) = 0, i.e. 

t">00 


T l = f T o ' 


(32.3) 


(32.4) 


Case 2: 1 < y < s. 

The state transition diagram has the form 



The probability of remaining in a state is 1 — AAt - /iAt, i.e., neither the existing 
order has arrived nor a demand has occurred. This applies also for the state y = s. At 
the latest, there was an order made at the start of the interval At. We have 

x (t + At) = [1 — AAt — /xAt] t (t) + AAtr (t) . 

y y y 


(32.5) 
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In the stationary case 



1 < y < s . 


(32.6) 


Case 3: s < y < D 

These states can also be reached only from higher stock levels, as the state-probability 
diagram shows 


s 

Mt 

y 

Mt 

y+l 


c_i 


1 - Mt 1 - Mt 

Figure 32.4 

The recursive equation of the state probabilites is 
x y (t + At) = [1 — AAt] x y (t) +AAtx y+1 (t) . 

The stationary solution is 

J , 1 = T . 

y+l y 



(32.7) 


(32.8) 


Case 4: D < y < S 

These states occur as a result of a demand as well as a delivery 


fA t 



Figure 32.5 
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Then 


x y (t + At) = [1 — AAt] x y (t) +AAtx y+1 (t) + Mt* y _ 0 (t) 


(32.9) 


from which follows 


r y = Vl + X 'y-D 


K-I 


(32.10) 


Case 5: y = S 

The upper boundary point of the state space can only be achieved by an arrival of 
goods. 


1 



Figure 32.6 

We have 

Xg(t + At) = [1 — AAt]xg(t) + /iAt Xg_j)(T) (32.11) 


and 



(32.12) 
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In summary, 


i—*■ 

ii 

o 

y = o 


0 < y < s 

j . i = t ; 

y+i y 

s < y < D 

I = X w + ^ x n ; 

y y+1 X y-D ’ 

D < y < S 

T S = f J S-D ; 

y = s. 


We set 


P = 


a = 


X 

P 

1 + P 
P 


and represent the state probabilities independent of it . 



7 r is then determined by the normalization condition ^ ir = 


1 


T o = 


1 


D — + 1 
P 


(32.13) 


(32.14) 
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In the next step, the average cost C per unit time in the stationary case will be 
calculated with the help of the state probabilities 

S 

C = V s + T s+l^ k + aD ] + h 1 yT y (32.15) 

y-i 

= T o jg + T 0 jr A + aD l + hT o /? ’ 


whereby 


/? = [D 2 + 2Ds - 2s - D] + a s [a(s + D) - s - 2D] + D. 


(32.16) 


Substituting 7r Q from (32.14) results in 


C — jj— + r d\ + 


(g “ a ) A - + ffh. 


(32.17) 


a 


D — + 1 
P 


We now attempt to determine the optimal values s*,D* . This is still possible in the 
border line case u > > A. 


Borderline Case: /i > > A 

p << 1 follows from n >> A and from it a >>1. The objective function (32.17) 
becomes 


kA , 


aA 


U 


i[D 2 + 2Ds-2s-D] +^- 


(ft 


s — 2D 




lira C ft = C = + aA + |(D + 2s + 1) . 


(32.18) 
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The equations 

(32.19) 

(32.20) 



give the necessary conditions for an optimum. 

We have a boundary extremum with regards to s. As expected, we obtain in the 
borderline case // > > A the results of the model without delivery time from §22. 


Reserved Storage Area 

In all other cases, one must determine the solutions to D and s either by numerical 
methods or by simplifying the model such that it can be solved by analytical methods. 
The source of the difficulties is the term /?. 

The average cost C in (32.15) depends on all inventory levels y = 0, 1, 2, ... S. The 
problem eventually becomes simpler if the inventory cost is measured at its maximum 
level: h(s + D). This is the case, for example, if one does not own the warehouse, but 
reserves storage area at an external warehouse. It must be big enough to cover the 
maximum stock level. The objective function is then 

C = r Q g>l + [k + aD] + h(s + D) . 

After a short intermediate calculation one obtains 


C = Aa + - a )^ + kAftS + h(s + D) . 

p + Da s 


(32.21) 


The term /} no longer appears here. With this, the minimization of C relative to s and 
D becomes simpler. However, we still cannot avoid numerical methods. 
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533 POISSON DEMAND, FIXED DELIVERY TIME r 

We consider an inventory model with continuous review, Poisson demand and a fixed 
delivery time r. For the formulation of the model we now use Bellman’s Principle of 
Optimality. Future costs are not discounted. 

If we observe the inventory level y in period t, we can influence the level with an 
immediate action at the earliest period starting from period t + r. Until then we have 
no influence on what has already occurred. We apply the cost l(y) which corresponds 
to an event occurring in period t + r. Previously placed orders may arrive between t 
and t + r. The inventory level y t + r is therefore dependent on y t , on the amount on 

order and on the decision made in period t. Hence we define in the following model the 
state y 


Stock on hand plus on order 


Unsatisfied demand is backlogged (BACKORDER CASE) 


Delivery period r 


l(y), cost which 
j starts here 


t + r 


Figure 33.1: Costs are incurred at time t + r for a delivery period r 


The inventory level y at time t + r is a random variable. 


The probability (Inventory = y — u at time t + r | Inventory = y at time t) 


= Probability (Demand = u in time r) 
= - e~^ r for Poisson Demand. 

vl» • 
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The expected inventory and shortage costs in period t + r are 

y oo 

f(y)=h J (y-u) + g ^ (u-y) ■^-e _AT . (33.1) 

11=0 u=y+l 

As was shown in §26 (compare (26.1), (26.2)), this expression may be rewritten as 


y 

f(y) = (h + g) J P u + gO*-y) , 

11=0 

where 



Note: The discrete distribution function is written as P =P(u<x) instead of the usual 

A 

convention P =P(u<x) so that the above expression (26.2) derived by integration may 

A, 

be used for discrete demand. 

To simplify the notation, we define the demand rate A as demand per unit time r. 
Hence, we do not need to use r explicitly in the notation. 


Principle of Optimality 
Let 

C : Average cost per unit time 

For a stationary cost increase, the cost 1 becomes the total cost 1 + CAt if one moves 
the present time period back by At. According to the optimality principle of Bellman 
we have the recursion 
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l(y) + CAt = f (y)At + [1 — AAt] l(y) + (33.2) 

+ AAt Min {k<5(x — y + l)+a(x — y + 1) + l(x)} , 
x>y—1 

for s < y < S. By cancelling l(y) from both sides and dividing by At one obtains 

Al(y) + C = f (y) + A Min {k£(x - y + 1) + a(x — y + 1) + l(x)} (33.3) 

x>y-l 

for s < y < S. 


Structure of the Optimal Policy 

We begin by considering a starting inventory y = S. This level drops in the course of 
time. Sooner or later, stocks must again be ordered. This happens for y = s. The 
order quantity is S — s = D. The structure of the ordering rule is, therefore, of the 
(s,D)-type. Following this, one can divide (33.3) into 

A1(S) + C = f(S) + Al(S-l) 

A1 (S — 1) + C = f (S — 1) + Al(S-2) 

• 

Al(s + 1) + C = f(s + 1) + A[k + aD+l(S)] . 

The sum of these individual equations gives 
S 

DC = ^ f(y) + Ak + AaD . (33.4) 

y=s+l 

In this case, the average cost C per unit time is a function of the structural parameters 
s and D. The optimization problem is 


INVENTORY CONTROL: MODELS AND METHODS 


131 


s+D 

C = jj J f(y) + -* 

y=s+l 


Min 

s,D 


(33.5) 


The constant Aa does not influence s and D. Thus we consider the average cost c, 
without proportional ordering costs. For simplicity we use the integral representation 
of the sum 


c = 


f i s+D 

• J J f(x)dx + 


Ak 

r 


Min 

s,D 


The conditions 


dc 


= 0 and 


dc 

W 


0 necessary for an optimum give 


(33.6) 


f (s + D) = f (s) 

s+D 

D-f(s + D) — / f(x)dx — Ak = 0. 

s 


Substituting the first equation into the second results in 

s+D 

(s + D)f(s + D) — sf(s) — J f(x)dx = Ak 

s 

x = s+D s+D 

xf l x = s - J f(x)dx = Ak . 


The partial integral /f dx = xf — Jxf 7 dx finally leads to the necessary optimal 
conditions 


(33.7) 

(33.8) 


s+D 

f xf'(x)dx = Ak 
s 

f(s + D) = f(s) . 


These two equations can be solved numerically. One may obtain an approximate 
solution in an analytical manner if one expands both of the above optimality conditions 
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in a Taylor series and solves the system of equations for s and D. One obtains explicit 
formulas for s and D. The same results, however, can be achieved with less effort if the 
objective function (33.6) is first approximated by a Taylor series and then the partial 
derivatives set to zero. It is 

f (x) = f (A) + (x-A)f'(A) + (x-A) 2 ^t^- + ... (33.9) 

Integrating term by term 

s+D 0 

f f(x)dx = Df(A) + f'(AA) ax + ... 
s 

= Df (A) + ^^-D[D + 2(s — A)] + 

+ D[D 2 + 3D(s — A) + 3(s — A) 2 ] + ... , 

and stopping after the second order term in x. We obtain the approximate objective 
function c 

c = jp + f(A) [D + 2(s — A)] + 

+ --^ A ) [D 2 + 3D(s - A) + 3(s - A) 2 ]. (33.10) 


The condition 



= 0 gives 


D + 2(s - A) =-2 


(33.11) 


The condition 


dc 

W 


= 0 results in 




(33.12) 
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If one substitutes the equation (33.11), which was solved for s — A, into the above 
equation, one obtains for D an expression independent of s 



3 


mr~ 


(33.13) 


Equation (33.11) gives 


s 



D 

2 


+ A 


(33.14) 


Note: The assumption of a Poisson distribution is implicit in the formulation of the 
Principle of Optimality (33.2). Since the Poisson distribution is a discrete distribution, 
one must use the first or the second difference quotient of f instead of its first or second 
derivative. The approximation by a continuous distribution is simpler. 


A pproximation by a Normal Distribution 

For large A, the Poisson distribution can be closely approximated by the normal 
distribution. Let 

N((x—A)/VX): Distribution function of the standardized normal distribution, 

when the Poisson distribution has an expected value of A. 


f'(x) = (h + g)N(^-—-) -g; 

1A 

f'W - V 


f"(A) 


h + K J_ . 
fJr 


Then 
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Since a - it follows from (33.15) 
D ~ a . 


(33.15) 

(33.16) 

(33.17) 
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Since f'(A) and f' '(A) are identical for the normal and logistic distributions, we again 
obtain for D the formula (33.15). 

We now substitute f(x) in the necessary optimality condition (33.8) and get 


(h + g) i In 


1 + e 


1 + e 


“(s+D-A) 

ImT 


= gD. 


(33.19) 


Since D is known from (33.15) one can compute the expression 


A = e 


m D 
a 2 


With a further simplification, let 


V = e 


m/ D 

^:(s + ^-A) 


(33.20) 


then equation (33.19) becomes 

1 + T 


and it follows that 


V = 


Z - 1 


A - 


After computing V, the value of s can be determined. We solve (33.20) for s and 
obtain 



(33.21) 
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with a = (J and m = 


(Tl T 


Detailed computations show that equations (33.16) and (33.21) give good approximate 

* 

values for the optimal value s . The approximation given by (33.15) for D, however. 

* 

results in most cases into an underestimation of the optimal value of D . 


It is possible, however, to correct for underestimating D. It can be done in the 
following way: 

The function f is expected to assume its minimum near s + y . However, the value A, 

which was expanded around f(x) in a Taylor series (33.9), may be far from the 
minimum point. In the hope of a better approximation, one can expand f(x) in a 
Taylor series about the point s + ^ instead of the point A. The calculation leads, 

however, to complicated expressions. 



Figure 33.1 








INVENTORY CONTROL: MODELS AND METHODS 


137 


But we can improve the value of D by a correction. After D and s have already been 

computed using (33.15) and (33.16) or (33.21), we can compute an improved D = 

D according to the formula 
new ® 



Cost Function 

The following equation applies to the cost function 


_ kA 1 r r/ x i 
c “ JT + D f f (x)dx - 
s 

kA 1 (■ r ( \ i S+D r r / / \ j -| 

= IT + n[ xf ( x ) Is “ J x i' (x)dxj = 



+ D)f(s + D) - sf(s) 1 

U 


s+D 

/ x f'(x)dx 
s 



(compare (33.7)) 


With (33.8), it becomes 


c - f(s) = f(s + D) 


(33.22) 


§34 POISSON DEMAND, STOCHASTIC DELIVERY TIME, SINGLE ORDER 

Under competition conditions, service reliability is an important factor. The supplier 
will try hard to meet his delivery schedules. In inventory control, the primary 
variability is in the demand. The opposite is true in a monopolistic situation or in 
places where goods are rationed; then the primary variability is in the delivery time 
rather than in demand. This is often observed in developing countries. 
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We now set up a model with stochastic delivery time. The stock is reviewed 
continuously. As long as there is still a standing order, no further orders should be 
placed. Let delivery time and demand be independent of each other. Both follow a 
Poisson process. This inventory model was already covered in §30. In that section, the 
method of state probabilities was applied. We attempted to derive a formula for s and 
D but were not successful. In this section, we apply the Principle of Optimality. Let 

/iAt: Probability that an existing order is delivered in period At 

AAt: Probability that a unit of good is demanded in period At 

t : Time since the last order 

t = 0: No order is placed. 

l(y.t): Value function in the stationary case 


Undiscounted Case 

We now formulate the Principle of Optimality for the undiscounted stationary case. 
Let t = 0: 


l(y,0) + CAt = hyAt + [1 — AAt] l(y,0) + 


+ AAt Min {Min {k + aD + 1 (y—1,At)} 

D 


order 


i(y-M)}, 


do not order 


whereby 

Al(y,0) + C = hy + AMin {Min { . } | 1 (y—1,0)} 

D 


(34.1) 


follows. 
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With a large initial inventory, an order is not worthwhile. The lower the initial 
inventory, the lower is the cost advantage of the decision " not to order". At a certain 
point y = s, it is more favorable to order. Since inventory is reviewed continuously, 
one immediately orders the quantity D at y = s. Considering this plausibility, the 
model with delivery time also gives a justification for the (s,D)-policy. 

Let t > 0: 

As long as there are still pending deliveries, one may not place a new order. This 
situation creates no room for decision-making. One need not apply the Principle of 
Optimality. The cost recursion is given by (see §23) 

l(y,t) + CAt = hyAt + [1 — AAt — //At] 1 (y ,t + At) + AAtl (y—1 ,t+At) + //Atl (y+D,0). 

(34.2) 

The boundary condition for y =0 is given in the lost sales case by 

1(0,t) + CAt = AAtG + [1 — AAt — //At] l(0,t+At) + AAtl(0,t+At) + //Atl(D,0). 

(34.3) 

Let 

G: Penalty cost if an order is not delivered thereby causing customer 

dissatisfaction; this is independent of time (dimension: cost) 

As At -> 0, from (34.2), the differential equation becomes 

[A + //] l(y,t) = hy + A1 (y—1 ,t) - C + // l(y+D,0), t > 0 (34.4) 
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with boundary condition 

- ft = -C + AG — ji[l(0,t) - 1(D,0)] . (34.5) 

Hence, the optimization problem may be described as a linear differential equation 
which can be solved by integration. 


Discounted Case 

r t 

We are interested in l(s,0). Let r be the interest rate and e the discount factor from 
time t to time zero (see §21). With 

q(r)dr. Density of the delivery time distribution 

one obtains 

T S 

l(s,0) = k + aD + 7 q(T) f V f(s-u) e ~ At e~" rt dt dr + 

r=o t=o 


=: F(s) 


00 


+ 


Jq(0 

0 


1 


u! 


( A+r ) r w«_ 


l(s ~ u + D,0)dr + 


n-o 


00 

+ y ■^-e“^ +r ^ r l(D,0)dr . 

O _ . H 


u=s+l 


(34.6) 


For y > s, the following consideration applies: The average waiting time of the system 

1 

in state y is j. For this time, the inventory cost hy is incurred. Inventory then drops 
to y — 1. The resulting costs l(y—1,0) are discounted by the factor e~ r ^ 
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1 (y>0) = j^ + pl(y-l,0) ; y > s . 


(34.7) 


Special Case: Internal Production or Just-fa—Time Delivery 

If we produce on our own, the supply quantity is under our control. We assume that 
the good is expedited from the finished goods inventory to the sales warehouse. 

Delivery time arises because the order must line up in a queue of already existing 
delivery orders and, therefore, remains unprocessed for a time. Hence, the amount still 
to be delivered can be changed at the last minute. In this case, the order quantity can 
can always be updated up to last moment just before delivery such that inventory is 
filled up to y = S upon the arrival of an order. Under this assumption, the equation 
(34.6) changes to 

l(s,0) = k + aD + F(s) + 1 (S,0) Jq(r)e r dr . (34.8) 

o 

v -v-' 

= : a 


a : Expected value of the discount factor over the delivery time 

Equation (34.7) remains unchanged. This applies especially for y = S 

S 

l(S,0)=j J y + /?^l(s,0) . (34.9) 

y=s+l 


Substituting l(s,0) from equation (34.8) and evaluating the sum, one obtains 


1(S) = 


1 

: it 

1 —pa 


h _Hs+il] +/ ,S-S[ k + aD + F ( s )] 


(34.10) 


The second argument r = 0 in the cost function 1 is no longer carried because the 
recursions (34.9) and (34.10) are always related to r = 0. 
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It is also not possible here to express formulas for the optimal values s*, S*. One 
obtains them by minimizing (34.10) 

-V(l +/J S - s [k + aD + F(s)]}-Min , 

1 —pa s,S 

whereby one can restrict s and S to integer values. 


§35 POISSON DEMAND, STOCHASTIC DELIVERY TIME, 

MULTIPLE ORDERS 

We now extend the inventory model to the case where a new order may be placed even 
before an existing order is delivered. We first the treat the case: 


Delivery Time r is Exponentially Distributed 
The delivery time distribution has the density 

q(r)dr = //e ^ r dr . 

The exponential distribution has the advantage that one need not know how old a 
pending delivery is. We assume that all delivery times are identically distributed 
(same supplier). 

Due to the Poisson demand, all orders are stochastically independent of each other. 
Because of this reason, the probability of an order occurring is 

a) in case one order is pending: n At 

b) in case m orders are pending: m/zAt 
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The number 


m: Pending orders 


is the second state variable in this inventory model besides the stock y. Hence, we 
need not add the pending orders to the inventory. 

y : Physical inventory (or shortage) 


The inventory and shortage costs are 


v(.y) = 


hy, f° r y ^ 0; ( ) , 

-gy, for y < 0; 


hy, for y > 0; 
—GA, for y < 0. 


(35.1) 


Each single order has a lot size D. The principle of optimality without discounting is 
given by 

l(y,m) + CAt = ^(y)At + m/jAtl (y+D,m—1) + [1 — m//At — AAt] l(y,m) + 

AAt Min {k + aD + l(y—l ,m+l) | l(y—l,m)} (35.2) 

or after transposition, simplification and division by At 

(A + m//)l(y,m) + C = <p( y) + m//l(y + D,m — 1) + 

+ A Min {k + aD + 1 (y—1 ,m+l) | 1 (y—1 ,m)} . (35.3) 

This is a difficult differential equation. We defer, therefore, to a heuristic solution, 
e.g., in which we introduce equidistant order points s^, S 2 , ... 
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Figure 35.1: A number of equidistant order points 
The problem "determine M and D" can be solved with the help of state probabilities. 


Delivery Time r arbitrarily Distributed — No Overlapp in g 

Let the delivery time be arbitrarily distributed. No overlapping means what was 
ordered first arrives first. 

For a fixed delivery time r we used the concept of relating the inventory cost to the 
time period r. We will proceed here in the same manner. Since r is stochastic, the 
demand u within r becomes a random variable. Let 

p(u) : Probability that demand u occurs within the delivery time 

For a Poisson demand with rate A 

pO) = / ^ e ~^ r q(r)d7 . 

0 


(35.4) 
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Example: r is Gamma-distributed 


j +i jp-M 7- 

q(r)dr = £—-dr ; 

J * 


/ \ i 7 1 i \U Hn, \u+j+i u+i rt — (A+/x) j 

P( u ) = 77 su+j-t-f ^ uT jl^ J r J e v ^dr 


(A+/j) 


NB: |/? 1+1 r i e '^ r dr = i! 

0 


»W’ Is iT#(r47) , <rK) J * 1 


+ l*‘ 


- )(-??( 


(35.5) 


This is a negative binomial distribution with exponent -(j + 1) and probability 
p= A/(A+/i). A Poisson demand occurring within the delivery period, where r is 
gamma distributed, becomes an exponentially distributed random value. 

The expected value of the inventory and shortage costs f(y) in relation to the expected 
value of r is 


y oo 

f(y)=h j (y - u)p(u) + g J (u-y)p(u) . 
u=o u=y+l 


(35.6) 


The Principle of Optimality is 

l(y) + CAt = f (y)At + [1 - AAt] l(y) + 


+ AAt Min (k5(x-y+l) + a(x—y+1) +l(x)} . 
x>y-l 


(35.7) 



146 


CHAPTER 4 : STOCHASTIC MODELS WITH CONTINUOUS REVIEW 


It has now become simpler compared to (34.2). It is because all of the remaining 
orders are ignored. The expected costs depend only on the last order (no overlapping!). 
However, f has become more complicated. In the limiting case "fixed delivery time", 
q(r) is turned into an uncharacteristic distribution. 

Since (35.7) is identical to the principle of optimality (33.2), the minimization of the 
costs per cycle leads to the same formula as in the model with fixed delivery time (§33) 
but with another formula (35.5) for p(u). 


Example with discounting 

We consider in this example the discounted case. Let 



: Interest rate 
: Discount factor 
« (1 — rAt) for At < < 1. 


In the discounted case, the Principle of Optimality for small At is 

l(y) = f (y)At + (1 - A At) (1 - rAt)l(y) + 

+ AAt(l — rAt) Min {k<S(x—y+1) + a(x—y+1) + l(x)} . 

x>y-l 


If one ignores the terms of higher order in At, we have 

J (y) =ir? f (y) + 

■y~— Min {k<S(x — y + 1) + a(x — y + 1) + l(x)} (35.8) 

x>y-l 

This formulation is no different from the inventory model with periodic review 
(compare the Principle of Optimality principle in the formulation (36.4)). 


The next chapter is devoted to such models. 


CHAPTER 5: 

STOCHASTIC MODELS WITH PERIODIC REVIEW 


This chapter may be read independently of the previous chapters. 


§36 THE ARROW-HARRIS-MARSCHAK MODEL 

Continuous inventory monitoring is no longer a problem with the introduction of 
electronic data processing. Many firms, however, still maintain periodic inspection and 
decision. Sometimes it is due to arrangements made with suppliers which only allow 
an order at specific (often equidistant) points in time. Two successive possible points 
define a period. It is then superfluous to monitor the stock during the period since one 
does not have any use for this information. An inventory review (physical or book) at 
the start of each period is enough. 

Multi-period models with stochastic demand require a more flexible ordering rule than 
models with continuous monitoring. The lot size is chosen depending on the actual 
inventory. Hence, such models require dynamic programming (DP) as a method of 
solution. For a rigorous handling of this problem, dynamic programming is applied 
here for the first time. The following is, at the same time, an introduction to the 
thinking process of dynamic programming. (The Principle of Optimality, from which 
DP is based, was already formulated a number of times in the previous sections; a 
computational method of DP was already used in §25.) Some algorithms for stochastic 
DP are explained in Chapter 6. 

The basic model of inventory with periodic monitoring was formulated by KENNETH 
ARROW, TED HARRIS and JACOB MARSCHAK and is named AHM-Model after 
them. (ARROW & HARRIS & MARSCHAK (1951)). 

The following conditions apply: 

1. Periodic review and decision 

2. For all periods, demand is random, independent and identically distributed. 
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3. Unsatisfied demand at the end of each period 

a) is lost (LOST SALES) 

b) is backlogged (BACKORDER) 

4. Deliveries arrive immediately. Delivery time is zero. 

5. Finite or infinite planning horizon. 

6. Discounting 

Let 

p : Discount factor for one period 

n : Planning horizon 

y : Inventory at the start of a period immediately before a decision 

x : Inventory at the start of a period immediately after a decision 

x-y : Order quantity 

u : Demand, random variable 

P(u) Distribution function of demand with density P u du 

f(x) : Expected value of inventory and shortage costs for a period 
a : Proportional order cost 

k : Fixed order cost 

The cost function was previously denoted by 1. In dynamic programming, the value 
function is usually denoted by v. Since periodic models play a great role in DP, we 
assume the following notation (in place of 1 we now use v): 

v : Value function (previously 1) 

v Q : Costs incurred at the end of the planning horizon 

v Q = 0: By assumption (as long as no other value is set) 

The following figure shows the operational characteristic of inventory control using the 
so-called (s,S)-Policy (compare 39.1). The heavy lines show the inventory movement 
along time; the dashed lines show other possible inventory levels. 
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The value s is the order limit and S is the full inventory point. 

The (s, S) policy states: If y < s at the time of inspection, then place an order sufficient 
to raise inventory up to S. (Delivery times are ignored for a moment.) 

Inventory y 



The expected inventory and shortage costs with inventory cost rate h and shortage cost 
rate g are 


X oo 

f(x)=h J(x - u)dP(u) + g J(u — x)dP(u) = 
o x 

(§26) x 

= 0 + g) / P(u)du + g(/x-x) . 

0 


(36.1) 


H is the expected value of demand. 
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According to the BERNOULLI Principle, we choose as our objective function the 
expected value of all costs during the entire planning horizon. We divide these costs 
into the costs of the period immmediately prior to it (single period costs) and the costs 
of the remaining problem. The planning horizon is reduced by one period and the 
starting inventory for the remaining problem is identical to the inventory at the end of 
the first period. For 

v n (y) • Expected value of all costs with initial inventory y, planning horizon n 

and optimal inventory management 

the recursion applies (with given boundary condition v (y) = v Q = 0) 
a) in the BACKORDER case: n = 1, 2, 3, ... 


00 

v n (y) = Min {k<5(x - y) + a(x - y) + f(x) + p ) v j(x - u)dP(u)} 
_x>y_o_ 

(36.2) 


b) in the LOST SALES case: n = 1, 2, 3, ... 


v n (y) = Min {k<5(x - y) + a(x - y) + f(x) + p j v .(x - u)dP(u) + 
x>y o 

_ + [i - p ( x )l Vi(°)} • _ 

(36.3) 

The Theory of Dynamic Programming (RICHARD BELLMAN) was first used to 
develop the AHM-Model. Hence the above recursion method was called the 
BELLMAN Principle of Optimality and, both equations (36.2) and (36.3) the 
BELLMAN Functional Equations. 

The stock level immediately before a decision is given by y. One can also formulate 
the functional equations related to a state 
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: Inventory after a decision (BECKMANN (1968)): 


00 

v n ( z ) = f ( z ) + P S Min {k<5(x - y) + a(x - y) 

o x-y 

+ v n-1 (z + x-y-u)}dP(u) . (36.4) 

This form is advantageous for numerical purposes since the state space is smaller. 
However, it is not pursued here. 


§37 THE AHM-MODEL IN THE STATIONARY CASE 

The AHM Model in the stationary case is more amenable to an analytical treatment. 
Aside from this, the functional equations are simplified because the iteration index n is 
disregarded. 

Several times previously, we used the method for the stationary case which takes out 
the proportional ordering cost component of the cost function to be minimized. In the 
long run, the receipt of goods must be equal to the withdrawal of goods, i.e., the 
expected stationary mean order quantity in the BACKORDER case is equal to the 
expected value (i of demand in one period and hence independent of the ordering rule. 

We now want to use the same consideration for the stationary AHM Model. The value 
function (BACKORDER Case) is 


00 

v(y) = Min (k<5(x-y) + a(x-y) + f(x) +p j v(x - u)dP(u) } . (37.1) 

x>y o 

In order to consider all costs at their present value these are discounted by p < 1. 

The expected value of all proportional costs are now to be removed from v. As an 
approximation, it is assumed that at the start of each period the expected sales h of the 
previous period is ordered. Then the expected value of all proportional ordering costs 
is 
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A 

We substitute the adjusted value function v(y) 


v(y) := v(y) -a(^-^-y) 

in the functional equation (37.1) 


v (y) + a ( /_ 0 - y) = Min {k^(x — y) + a(x-y) + f (x) + p ) [v(x 

^ x>y o 

+ + u )l dp ( u )} 


Lu 

Min {k<5(x — y) + ax — pax + f (x) + ap — + pap - 

x>y . p 


=: f(x) 






ay + 




00 


+ p J v(x — u)dP(u)} 
0 


and obtain (in the BACKODER case) 


v(y) = Min {k<5(x - y) + f(x) 
x>y 


00 - 


+ P \ v(x 
0 


- u)dP(u)} 


with 


f (x) = ax(l — p) + f (x) . 


(37.2) 


(37.3) 


-u) 


(37.4) 


(37.5) 


In the LOST SALES case, this trick cannot be applied since the average order quantity 
per period is less than the expected demand. 
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In the following sections we will deal with two questions: 

1. How can one obtain a concrete solution for a single case? 

2. How does the structure of the optimal solution look like if one can, at all, speak 
of a structure? 

538 STANDARDIZATION 

The AHM Model can be standardized under certain assumptions. 

Condition CCll: 

The demand u has a probability distribution of the form P(u;//, a) = Q( u ~ which 
can be standardized. 

We then set 


U = H + (T€ (38.1) 

H : Expected value of demand 

a : Standard deviation of u 

t: Random variable with /r = 0, a = 1 (stochastic component of demand) 

q(^)=P(e)- 


Condition (C2): 

Let the fixed order cost k be of the form 

k = k Q <r . (38.2) 


The Principle of Optimality can be formulated in the following manner 
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v(y) = Min {k<5(x — y) + a(x — y) + h f (x-u)q( u - ^ )d(^) + 
x>y u=o 


VA/ 

+ g / (u - x)q(S-=-£)d(J)} + p /v(x - u)q(^)d(J). 

X 

(38.3) 

The right-4iand side becomes proportional to a under condition (C2). 
we perform the following variable transformations 

To show this, 

t -- x ~ H • 

^ a ’ 

(38.4) 

• • 

II 

<=* M 
• 

(38.5) 


In addition, de = d(^) and we define 

ov(T}) := v(y) . (38.6) 


Then from (38.3) 

<rv(r)) = Min {k <t 6(( - v) + a<x(£ - tj) + <rf (() + 

(>ri 0 

+ p f <rv(( — e)p(c) de} . (38.7) 

The factor a is cancelled and one obtains the STANDARDIZED EQUATION 


v{n) = Min {k + a(£-J?) + f(0 + 

(>V 


+ p J c)p(c)de} 


(38.8) 


The derivation shows: 

1. The expected inventory and shortage costs are proportional to the standard 
deviation of demand and independent of its expected value, v increases with 
increasing a. 
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2 . 


h K o 

Once and for all the standardized problem for different cost rates - and - is 

© © 

solved. With the help of the reverse transformation 


x = n + , 

the optimal policy of the given problem is derived from the optimal policy s*, 
S* and is given by: 

- w - 

s = [l + <rs ; 

* 

S = n + 0‘S 


The conditions (Cl) and (Cl), however, must be checked. 


§39 EXPONENTIALLY DISTRIBUTED DEMAND 
We attempt to hold on to our previous solution schema: 

1. The structure of the optimal ordering rule in parametric form is given; 

2. Derivation of the cost function c; 

3. Minimization of v with respect to the parameter of the ordering rule determines 
the optimal ordering rule. 

The objective will only be achieved if two conditions are fulfilled: 

- stationary model, 

— the optimal ordering rule has the assumed parametric structure. 

Again, we first postulate the structure of the ordering rule and optimize only within 
this structure. (It will be shown later that the global optimal ordering rule has the 
assumed structure.) 
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Assumption: Let the ordering rule be an (s, S) policy 


s < y < S: do nothing ; 
y < s: replenish up to S. 


(39.1) 


If the initial stock y > S, one waits until the stock has fallen down to S. The (s, S) 
policy then applies at that point and S is the maximum stock level. The value 
function, restricted to the class of the (s, S) policy, is (in the BACKORDER Case) 


A 



/s 00 

f (y) + p f v(y - u)dP(u) , for y > s ; 
o 


00 - 


k + f (S) + p J v(S 

o 


- u)dP(u) 


, for y < s . 


(39.2) 

(39.3) 


The proportional ordering costs are ignored in this model. It has been shown in §37 
that this is not a relevant restriction. We assume a continuous state space such that 
the two alternatives ("order" and "do not order") are both acceptable. The two 
equations (39.2) and (39.3) hold for y = s. 

For y < s, the value function (39.3) is independent of y. It applies specifically to 

A A 

v(y) = v(s) , y < s. 

Hence, (39.2) can also be written as 


v(y) = f(y) + p 



(y-u)dP(u) 


/S 

+ pv(s)[l 


P(y-s)] 


V > s . 

V - 


(39.4) 


/V /V /\ 

For y = s we have v(s) = f(s) + pv( s) or 


A 



(39.5) 
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In general, the preceding considerations still apply. We now attempt to solve the 
integral equation (39.4) for an exponentially distributed demand. 

We have, 


P(u) = 1 -e~ m 




p(u)du = ae~ m du, 

E M = p = J- 


The expected inventory and shortage costs are 

A 

f (x) = f (x) + ax(l — p) 

f(x) = (h + g)[x-^(l-e _ax )] + g(J-x) . (39.6) 

We substitute it in (39.4), perform a variable transformation £ = y — u in the integral 
and multiply the equation with This gives 

v(y)e ay = (h + g)[ye ay -i(e fty -l)] + & e fty + (a -g - p)ye ay + 

y - c 

+ ap f v(()e a ^ + a/?v(s)e ftS . (39.7) 

s 

Using the definition 

w(y) :=v(y)e ay , 

we have 

w (y) = (h + g)[ye fty -i(e fty -l)] + &e ay + (a-g-/>)ye ay + 

y 

+ op J w(£)d£ + op w(s) . 
s 
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By differentiation we convert this integral equation in a differential equation of the 
following form 

w'(y) - ap w(y) = [oy(h + a -p) + a-/>]e ay . 

With the integrating factor t~ apy , the differential equation becomes 

jjy(w(y)e _w ) = [oy(h + a - p) + (a-/?)]e a ^ ~ p ^ y . (39.8) 


We integrate (39.8) and again obtain the original cost function v by multiplying both 
sides by exp(apy — ay) . In the second step we substitute the boundary condition 

A 

(39.5) and get the required solution for v. 

In the third step, v is to be minimized with respect to y = S and s 


Min v(y) = v(S) =» S 

y 


A 

Min v(y) =» s . 

s 


However, the conditions gy = 0 and ^ = 0 necessary for the minimum lead to 
transcendental equations so that no explicit solutions can be given for s and S. 


To arrive at a solution we modify the cost structure. Let 

00 -t _ y 

f (x) = hx + g f (£ — x)e ^ d( = hx + ge 

x 


(39.9) 


In addition, let the expected value of demand - 1 which can be obtained by rescaling 
the units of demand without limitations. Then a = 1. Inventory cost is taken in 
relation to the start of the period and shortage cost at the end. Inventory y > 0 is 
therefore given a higher cost than before. Instead of (39.7) we now obtain 
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v(v) = hy + ge ^ ^—(hse s + g)e ^ + p f v(()e^ ^ d£ . (39.10) 

^ s 

The solution of this equation is 

C( y - eb »[— *fCL] e (M)(y-s). ( 39 . 11 ) 

“ (1 - 7? (1 -rt 2 ^ 

/V A 

v is convex in y and s. S = y is the optimal initial stock. Hence, the minimum of v lies 
at S 

v'(S) =0. (39.12) 

If one substitutes y = S and y = s in (39.2) and (39.3), respectively, and subtracts 
both expressions one gets 

v(s) —v(S) =k. (39.13) 

We use both these equations (39.12) and (39.13) to calculate s , S, and D = S — s. 

For y = s 

»'(S) = * (o - » e<p “ 11 (S " s) • 

1 p (i - py 1 p 

Since v'(S) = 0, it follows that 


D = S - s = | 1 . In [p + (1 — p) | e s ] . 


(39.14) 


From (39.13) we have 


-s 


+ gej__ hD_ 
T - p 1 - p 




(1 - P) 


(1 - P) 


(39.15) 
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|>+ (l-/») f e~ s ] - (1 - *>)D - e~ s + p]e^-^ 


s 0 r p )\ 

E ^ • 


(39.16) 


Using the shortened form 


9 ■= P + (1 -p) f e 


—s 


(39.14) becomes 



and (39.16) takes the form 



1 + 


II 


¥ 


-.2 

^-k 


+ In q 


(39.17) 


The values s and D are computed as follows: at first, q is determined iteratively from 
equation (39.17). Then s, S and D are given by 



D , T _ 7 .l nq ; 
S = s + D. 


(39.18) 

(39.19) 


(39.20) 
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§40 OPTIMALITY OF THE (s ,S )- POLICY 

The optimality of a (s, S) policy (S = s + D) was obvious for models with continuous 
stock review. Because of this, the analytical solution method was considerably 
simplified. Does it also apply for the AHM-Model? 

We consider the AHM Model in the backorder case. The Principle of Optimality is 
given by 


00 

v (y) = Min {k«5(x-y) +a(x-y) +f(x) + p f v (x - u)dP(u)}, (40.1) 

x>y 0 n 1 

u = 1,2...,N. 

By evaluating this recursion, a solution can always be found. One begins with a 
starting value v Q defined from the problem and computes in sequence the chain 

v o 4 V 1 4 v 2 * • • • 4 v n 4 • • • * V N • 

One calls this the VALUE ITERATION of dynamic programming. A separate 
optimal ordering rule is obtained for each period. For an infinite planning horizon a 
suitable termination criterion is still to be defined. (For a stationary model, however, 
one switches to another method.) Value iteration requires a great deal of numerical 
computation. For example, if y varies between —1000 and 1000, then the minimization 
operation must be performed 2001 times for each n. For a single minimization step, the 
right-hand side of (40.1) is evaluated 1000 times on the average. 

The optimal ordering rule can considerably be simplified for a given structure. For a 
( s n , S n ) structure, for example, one first takes the minimum around y = —1000. One 

knows that an order will certainly be placed. This minimization step gives s . 
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For all remaining stocks y = -999, -998, ... mi ni mization is reduced to a comparison 
between the two alternatives " do not order" and " fill inventory up to S n ". As soon as 

s is known, minimization is dropped for the remaining y values y = s Q + p s n+ 2 , ..., 
S n- 


With this viewpoint, the question "When is a (s n ,S n ) policy optimal?” gains 
importance. It should be examined in the following manner: 

We rewrite (40.1) in order to formulate the problem more transparently. We remove 
the cost -ay from the minimization and obtain 

v (y) = —ay + Min {k<5(x - y) + ax + f( x ) + P \ v i (x — u)dP(u)}. (40.2) 

XI v XIX 


=: H (x) 
n v ' 


We now split the functional equation into two alternatives, (I) and (II) 



-ay + i 


H „(y) • 

k + Min H n (x), 


if no order (I); 
if order (II); 


and obtain the decision rule: 


in case, 


H n (y) — Min H q (x) =:AH n 
x>y 


< k 4 do nothing ; 
> k 4 order x-y; 


(40.3) 


(40.4) 


For AH n = k, both alternatives are equally acceptable and we may choose either one. 

A physical interpretation is offered in the following manner. For the inventory y to be 
affected by external intervention (order), it is necessary to overcome the friction by 
exerting a certain amount of force k. This would only pay off if k is less than the 
amount of potential energy AH to be released. 
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Force with energy k 



Figure 40.1: Physical interpretation: 

To overcome its friction, the point mass y 
experiences a momentum k and slides to x*. Hence, 
a potential energy AH is released. 

The quantity y which makes it worthwhile to order depends on the form of H . Two 

examples are shown in Fig. 40.2 and 40.3. Each of the order areas are denoted with 
hatched lines. The global minimum of H at S Q determines - without considering a 

possible higher initial stock y — the maximum inventory. 



Figure 40.2: (s ,S n ) policy 
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Figure 40.3: A more complicated policy 
This consideration makes two things clear: 

1. If k = 0, one increases the inventory to the optimal value at each stock review 
even if the deviation from this value is small. 

2. If H is convex, then the optimal policy has a (s n ,S n ) structure. 


The convexity assumption of H , however, is not suitable because, firstly, it is more 
restrictive than necessary (see Fig. 40.2; H is not convex; however, the optimal policy 
is of the (s , S Q ) type) and, secondly, the convexity is not passed on to H n + One 
sees this in the next example. 


For n = 1, 



= -ay + 


H^y) 

.k + ^(Sj) 


(I) 

(II) 
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If f(x) is convex => H, = ay + f(y) is also convex. 

For example, the following graph of v, is given below 



S 1 S 2 

Figure 40.4: Example for a non-convex but a k-convex v^ 


vj© is convex, v^© is a linear function with slope -a. For some values of a, is 
not convex for all y. Hence, 

H 2 (y) =ay + f(y) + /> J v 1 (y-u)dP(u) 
is also no longer convex for all y. 

However, one achieves the objective if the concept of convexity is generalized in a 
suitable way. 
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Def. 40,1: A function H(y) defined along the real interval [y ;y + ] is called 

k-convex if for all y c [y ;y + ] and for any a, k > 0: 

H(y + a) -H(y) - aH'(y) + k > 0. 

Def. 40.2: A non-differentiable function H(y) defined along the real interval 

[y~;y + ] is called k—convex if for all y e [y~;y + ], 0 > 0 and for any 
a, k > 0: 

H(y + a) - H(y) + ^ H (y) ~ Ml Z-A ] + k > 0 . 


At first, it is to be shown that the optimal ordering rule has a (s n ,S n ) structure for a 
fixed n if H fl (y) is k-convex. As one sees from Fig. 40.2, there exists an optimal policy 
exactly of the type (s n ,S n ) only if H n (y) to the left of y = s q never falls below the level 
H n (S n ) + k or it reaches the following: 



' < k + H (S T, for < y < 
n v ir ’ n J 

* 

> k + H (S ), for y < s 
l n v n y 5 J n 



(40.5) 


This corresponds exactly to the decision rule (40.4). The condition (40.5) is satisfied 
with certainty if H fl (y) falls monotonically for y < s , i.e., if 


H' (y) < 0 for y < s . 
n w ' J - n 


(40.6) 


The k-convex functions H Q (y) satisfy this requirement (40.6). We shall prove this by 
contradiction. We assume the opposite: 


The k-convex function H fl (y) has to the left of s n a relative maximum H n (yp, y^ < 
s n’ H n^ y P > k + ^ n (S n ) (see the following Figure (40.5)) 
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Hn(y) 



Figure 40.5: Proof of the optimality of the (s_,S ) policy if H n is k-convex 


Let s be the largest y value for which H (y) exceeds the level H(s Q ) = H(s Q )+k (in 
this case, H'(s ) < 0). Because of the relative maximum at y j, H Q cannot be 
k—convex since the defining inequality is not satisfied. If we choose S = y^ + a, then 
it follows that 


H 


n< S n> 


- »„(y 

n w 


1 


aH'(y 

n v ' 


1 


) +k < 0 



This is a contradiction to the condition of k-convexity of H . Hence, for y < s Q , a 

k-convex function H can not have a relative extremum. Since H'(s„) < 0, H (y) is 

n n v n 7 n KJ ' 

therefore monotonically decreasing for all y < s , i.e., the (s , S ) policy is optimal. 
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The following steps show that the k-convexity of H n is passed on to H ^ 

Statement 40.1: The basis is the model described above. Let H^(y) be k-convex. 
Then H fl (y) is k-convex for all n e M. 


Proof: 


n = 1: Hj(y) is k-convex by assumption, 

n > 1: Let H n (y) be k-convex. 

4 (s n , S n ) policy is optimal, i.e., we can use the (s , S ) policy to 
compute for v (y). Then 



v Q (y) = -ay + 


H n (y) , 


k + W’ 


for y > s Q ; 
for y < s n . 


We compute v (y). 

1- s n < y < v n (y) = -ay + H fl (y) is k-convex. 
2. y < s n < y + a < S n : It must be shown that 


v n (y + a) - v n (y) - av^(y) + k > 0 . 

Substituting (*) gives 

-a(y + a) + H n (y + a) + ay - k - H n (S n ) + oa + k > 0 
H n (y + a) - H n (S n ) > 0. 

This is always correct since H (S ) = Min H (y). 

y ! ' 
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3. y < y + d < s n : v n (y) = -ay + k + H Q (S n ) is linear, 


therefore, also k-convex. 


4 v (y) is k-convex 


00 


4 / v n (x - u)dP(u) is k—convex 
o 

4 H n + ^(y) = -ay + f(x) 4 - f v n (x-u)dP(u) is k-convex, since ay 
and f(x) are convex. 


Since H^(x) = ax + f(x) is k—convex, it was therefore shown that 


The AHM Model (BACKORDER Case) developed in §36 has 
an optimal ordering rule of a (s n ,S ) policy type 

for each period. 


The important point here is the form of the expected inventory and shortage cost. 
Again it is given as 


00 


f(y) = h I (y -u)dP(u) + g / (u -y)dP(u) . 
o y 


The inventory level as well as the shortage quantities are covered by proportional costs 
h and g, respectively. Originally, ARROW, HARRIS, MARSCHAK, KARLIN, 
SCARF, BECKMANN and others worked with shortage costs which were equally high 
(tailored to the situation in the Navy, compare §26.2). Inevitably, all efforts to arrive 
at optimal (s , S ) policies failed. 


Later, numerous generalizations of the above function f(y) were given, but the (s , S ) 
policies still remained among them, (e.g., VEINOTT (1966), SCHAL (1976)). 
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541 ELIMINATION OF PROPORTIONAL ORDERING COSTS 
WITH FINITE PLANNING HORIZON 

We attempt to integrate the proportional ordering cost a(x - y) into the inventory and 
shortage costs. That would be possible immediately if the proportional ordering cost 
was a linear function of the inventory x as in the case with the inventory and shortage 
costs, hx and -gx, respectively. To achieve this, we choose the approach: 

Formula: v Q (y) = v Q (y) + ay - a/i, \i = E{u} . 


The proportional ordering cost for v fl (y) is now 


a(x - y) + ay — a/i. (41.1) 


Idea: The term ay is cancelled and ax remains, a/i is a constant correction term. 
With this approach we formulate the Principle of Optimality. The functions v fl (y) 

follow that of equation (36.2). We have 


v (y) = Min {k<$(x - y) + ax - ay + f (x) + 
x>y 

A 

+ p f [v n _|(x — u) — a(x — u) + a/i]dP(u)} + ay — a/i 


= Min {k<5(x - y) + a(l — p) (x — /i) + f (x) + p f v _ 1 (x — u)dP(u)} + /?a/i 
x>y n 1 


A term pa/i still remains. To be able to eliminate this term we extend this approach to 
include a variable correction term, 


extended formulation: v n (y) = v n (y) + a (y “ I*) + b n > (41.2) 


and return to the functional equation (36.2): 
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v (y) = Min {k<5(x-y) + a(x-/i) + b n + f(x) 
x>y 


A 

+ P f [ v n _i( x ~«) -a(x-u) + a/i-b n _ 1 ]dP(u)} 


= Min {k<5(x — y) + a(l — p) (x — p) + f (x) 


A 

+ P I v n _i( x - u )dP(u)} + b n + p&n - pb^ 


! 

= 0 


The remaining term b n + pafi — pb n _^ must be eliminated. Hence it must follow that 


b n = /?( b n _ 1 - a /*) 

= P l> b n _2 “ & P) 
- p n b Q — pa/x y~ 


- a n\ 


n 


We choose as initial value b Q = 0 and obtain 



(41.3) 


Hence the remaining term is eliminated. 

In the last step we integrate the order cost in the cost function f(x). Let the new 

A 

function be f(x) 

A 

f (x) := a(l — p) (x — p) + f (x) . 


(41.4) 
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Previously, it was 

f(x) = (h + g) f P(u)du + g(/*-x) . 

0 


Hence, 

f(x) = (h + g) f P(u)du + [g - a(l - p)] (fi-x) . (41.5) 

o '- v -' 

A 

= •• g 

We define 


A 


h := h + a(l — p ) ; 

(41.6) 

g := g — a(l — p) 

(41.7) 


as the new cost rates for the inventory and shortage quantities, repectively, and obtain 
the results 


f- ~(x) = f(x) 



(41.8) 



(41.9) 


^ 00 ~ 

v n (y) = Min {k<5(x - y) + f(x) + p j v^x - u)dP(u)} . 
x>y o 


(41.10) 
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Comparison with the elimination of proportional ordering cost 
in the stationary model (§371 

The above method can be extended to the case n = oo. 


For n ^ oo, (41.3) becomes 


limb =: b = ^~ 
n 1-/9 


n-bx> 


5 


and from (41.2) 


v(y) v(y) + a(y -//) + b 
= v(y) + ay-y^— . 


For the stationary model in §(37) we used a transformation 


(41.11) 


(41.12) 


v(y) := v(y) + ay-j^^ . 


There it was 



: = f (x) + a(l — p)x . 


The difference between the two formulations is that in the stationary model the 
average ordering cost a fi is always taken at the end of the period so that it can be 
discounted. Therefore, the constant correction term is 



The formulation chosen in this section uses the ordering cost at the start of the period. 
Hence the correction term here is 
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Finally, the difference in the total cost between both models is that in the latter model, 
the average ordering cost a/j occurs more often, namely to stock up for the first period. 
This is also shown by the following formal consideration. 

A comparison of both transformations shows 


/s 


Av := v(y) - v(y) = a/t ; 


Af := f (y) - f (y) = a//(l - p) . 

The difference Av of the value functions can be explained completely by the different 
single period costs. It is, namely, the present value of the differences Af for all periods 

00 

Af ^ p n = a/z 
n=o 


which is exactly the difference Av. 

The advantage of the formulation chosen in this section is that the elimination of the 
proportional ordering cost can be interpreted as a modification of the inventory and 
shortage cost rates h and g. 


§42 BOUNDS FOR (s n , S n ) 

Bounds for the (s n , S Q ) policies will be derived in this section. 

s < s < s < S < S < S . (42.1) 

This is useful for numerical purposes. One can limit oneself to x values in a minimum 

search Min { } which lie within the intervals [s, s) and [S, S), respectively. Besides, 
x >y 
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the bounds are used for the proof of an optimal (s, S) policy for the stationary model. 
More of this later. 


1. Bound S: 

The replenishment point of the single period model will be a lower limit S for the 
replenishment point S n of all multiple period models, n > 1. 

A plausible justification for this is as follows: 

Since the fixed cost k > 0, in a multiple period model one will stock up for the future 
at the start of the first period. In a single period model stocking up for succeeding 
periods is not necessary. A rigorous proof will be given to show that this assumption is 
correct. It is already shown that for all periods a (s n , S fl ) policy is optimal, i.e., 

v n (y) = - a y 

We derive the proof of the model with proportional cost, v (y) is differentiable except 
at the point y = s . 

For n = 1 


+ 


, for y > s n ; 


+ k + H (SJ , for y < s„ . 


(42.2) 


H^y) = ay + f(y) (42.3) 

= :G(y) 

G(y) is the single period cost. It is independent of n. Take as its minimum at 
point S., i.e., 


G'(y) < 0 for all y < S^. 


(42.4) 
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We now show by induction: H^(y) <0 for all y < S^, n e IN. 

Hj(y) <0 for all y < was already shown. 

Let H^(y) < 0 for all y < S y Then from (42.2) 

f-a + H'(y) 1 

v^(y) = • < — a < 0 for all y < . 

—a 

With 

H' +1 (y) = G'(y) +/> J v'(x-u)dP(u) 
one obtains 

H^ +1 (y) - G '(y) ~p & < 0 for all y < Sj. (42.5) 

Hence H Q (y) assumes its minimum at y > for all n 6 IN. The minimizing value of H 
is the replenishment point in the n LI period counting backwards (i.e., the first 
period of an n-period model). It is therefore 

S 1 < S„ for all n E IN. 

1 ~ n 


(Bound due to IGLEHART (1963)) 
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Computation of S^: 


Hj (S^) = Min (x) <=» ax + f(x)->Min 
1 x x 


*=* ax + (h + g) J P(u)du + g(n — x) -> Min 

o x 

<=» a+ (h + g)P(x) - g = 0 


S 1 = P ■ 


The bound S, however, can still be improved if one uses the model with the 
proportional ordering cost eliminated. We have 


(y) = f (y) 


v n (y) = 


fH n (y) 

o 


< 0 for all y < 


is the minimizer of H^(y) = f(y). Following the schema of the proof given above 
one obtains 

H n (y) < f' (y) <0 for all y < Sj . 


Therefore S-, is also a lower bound for S„ 
1 n 


S 1 < S , for all n € IN. 
1 - n 7 


(Bound due to VEINOTT.) 
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/\ 

Sp however, is stricter than as shown by the following computation: 

HjtSj) =MinH 1 (x) <=> f(x) ->Min 
x x 

& f (x) + a(l — p) (x — n) -> Min 

x 

<=> (h + g)P(x) - g + a(l - p) = 0 

(42.6) 

/\ 

Since the distribution function P(u) of demand always increases monotonically, > 
S^. The advantage of eliminating the proportional ordering cost as proposed by 
BECKMANN is evident here (compare §36). 

/\ 

This fact is graphically represented in Fig. 42.1. The function f has its minimum 
farther to the right than f, but falls flatter to the left than f. 

(Reason: g = g — a(l — p) < g, i.e., the shortage cost is lower.) 




Figure 42.1: Lower bound S 
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H falls to the left of S, steeper than f. 


Since it only depends on the height differences AH and Af, respectively, H and f are 

/\ A 

moved in the above figure such that its minimum lies at equal height with f(Sj). 

AAA 

Then H lies over f for all y < S v Also, H lies over f for all y < S*. 
n •'i n J l 


2. Boundary s: 

We write the Principle of Optimality in the form 


v 


n 


(y) = Min {H (y) k + Min H (x 

x>y 



and first show that 

A A 

v n ( y ) < v n ( y/ ) + k > for y < y' . 


(42.7) 


We have 


v (y) < k + Min H (x) < k + Min H (y') < k + v (y') for y < y' 

li \ XL v , 11 11 

x>y x>y 


With the help of this inequality we obtain 


H n ( y )- H n ( y 0 = f(y)-f(y') +/? 1 [ v n _i(y - u ) - v n _i( y ' - u)]dP(u) 


<T 




for y < y', hence 


H n ( y ) -H n (y') < f(y) -f(y') + pk, y < r ■ 


We set y = s tf y'= S Q : 


(42.8) 
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= k, i.e. order 
k(l — P) < f(s n ) -f( s n ) • 

Hence we order: 


f (s n ) > f(s n ) + (1 - p )k 


(42.9) 


The above given inequality due to VEINOTT makes it possible to attribute the 
relationship between s n and S n , given by the result AH n = k of the recursive function 

/V 

H n , to the difference Af = (1 — pk) resulting from n independent costs f. 

A A 

AH = k 4 Af = (1 — p)k for . 

n \ r/ n’ n 

Hence an upper bound s is determined with given S from (42.9) 


s is the smallest number < S for which: 
f(s) < f(S) + (1 - p )k . 


(42.10) 


The following figure shows this: 
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Figure 42.2: Boundary s 


Since (1 — /))k > 0 in the discounted case and f'(y) <0 for all y < S, one also obtains 
the result 


s < S . 


(42.11) 


3. Boundary for s: 

/\ /v 

Consider H'(y) < f'(y) for all y < S, n g IN (compare (42.5)). From this it follows that 


H n (y)-H n (S)> f (y)-f(S) , y<S, 


A. 

i.e., the level difference k is created if the interval [y,SJ relative to H n is smaller than 


A 

that relative to f. This is evident in Fig. 42.3: 
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Figure 42.3: Lower bound s 

Hence, from the above diagram, s is the lower boundary for s 


s is the smallest number < S for which: 
f(s) < f(S) + k . 


(42.12) 


4. Bound S: 

rv /V 

Initially an upper bound S for can be derived from: f(S) = f(S) + k. A more precise 
bound, however, can still be found. Since the interval [s, s] is defined by the difference 

/V 

pk with respect to f, the interval [S, 5] must also be determined according to this 

A. A A 

difference with respect to f. Hence we obtain 5 from: f(5) = f(S) + pk. 
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S’ is the smallest number > S for which: 


f(S) > f(S) + pk . 


(42.13) 


Summary: 

The parameters s n , S n are defined over the functions H^: 


H (S ) =MinH (x) => S , 
n v n' n v ' n 5 

x 


H (sj = H (Si + k ^ 
n v n y n v n' 



Thus we order for all y < S n which satisfy the inequality 

H (y) —H (S ) >k . 
n u; n v n y 

/V 

The recursively defined functions H , however, pose some difficulties. It is possible to 
estimate specific values of s Q , S , which were previously determined from the 

/v 

differences of H n , from the differences in the single period cost f. With this, the 
bounds s, s, S, and 5 can be determined: 


s = smallest whole number for which: f(s) < f(S) + k ; 

A A 

s = smallest whole number < S for which: f(s) < f(S) + k(l — p ); 

A A 

S = smallest whole number which minimizes f(S) = Min f(y); 

, „ y 

5 = smallest whole number > S for which: f(5) > f(S) + pk. 
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The graphical representation of these results are shown as follows: 


f(y) 



Figure 42.4: Boundaries of the (s n ,S n ) policy 

As a by-product, the proof also gives the following results 

s < S, 

f( s n ) > f(S n ) + (1 —p)k. 


Remark: From the proof, it was assumed that a single minimum exists for f and f (at 

A 

points and Sj, respectively) and that the functions left of the minimum fall 

monotonically and, to the right of it, increase monotonically. Functions of this type 
are called unimodal functions. 
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f(x) 



Figure 42.5: Graph of a unimodal function f(x) 

with a minimum as extreme value 

Thus the above results apply not only for models with convex single period costs 
(special case of unimodal) but also, in general, to unimodal costs. Strictly speaking 
inventory costs are not proportional to the quantity y. Only the interest cost is 
proportional to it. The handling cost, e.g. acquisition cost, increases at a decreasing 
rate with a more efficient inventory organization. The typical movement of the 
inventory cost now looks as follows: 

a(y) 

A 


► y 



Figure 42.6: Returns to scale in inventory cost 
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This leads to a non—convex but unimodal function of the expected value of inventory 
and shortage cost f. 

In the original work by VEINOTT and WAGNER, the derivations of the bounds for 
s . S in an AHM Model for the discounted and undiscounted case (p = 1) as well as 

for the case with constant delivery time are performed. The same equations always 
apply for the bounds. 


§43 OPTIMALITY OF THE (s, S)-POLICY IN THE STATIONARY MODEL 

With the results obtained from the last section it is now possible to prove that a (s, S) 
policy is optimal for the stationary AHM Model with fixed costs. For the AHM Model 
without fixed ordering costs, one computes the optimal ordering rule of the (S) policy 
type by minimizing the single period cost. Since this does not depend on the planning 
horizon n, the (S) policy is also the optimal policy in the stationary model. 

In an AHM Model with fixed ordering cost, the optimal ordering rule of the (s n , S n ) 

policy type depends on the number of periods. Therefore it is more difficult to prove 
that a (s, S) policy is also optimal in the stationary case. We want to mention at this 
point only the essential steps (the proof is based on BANACH’s fixed point theorem. 
See COLLATZ (1968)). 

1. The limit v(y) := lim v (y) exists: the sequence (v } converges equally at 

n->oo 

each finite interval. It follows that 

max {I v n +i(y) _ v n (y)I} - p max 7T {l v n (y)- v n _i(y)l} (43.i) 

s<y<S s<y<S 

for all n e IN (CONTRACTION CONDITION) and, therefore, also 

max {I v n+ i (y) ~ v n (y)l} < / jnax {|v 1 (y) j> . (43.2) 

s<y<S s<y<S 
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It is enough to restrict the maximization within the area s < y < 5 since for 
y <s, v (y) is linear with slope -a for all n and y > 5 cannot occur in the 
stationary case. 

2. Under the condition v Q = 0, {v n ) is a monotonically increasing sequence, i.e., 

Vl(y)> v n(y) for all y, n . (43.3) 


3. The function v(y) satisfies the Principle of Optimality. We have 

00 

L(x,y,v) :=a(x-y) +f(x) + p jv(x-u)dP(u) 

o 

the right-hand side of BELLMAN’S functional equation. Because of the 
monotonicity of the sequence (v } 


v n (y) =Min 
x>y 


{ L ( x >y> v n _i)} < Min {L(x,y,v)} . 

x>y 


Hence, for n -* oo 


v (y) < Min L(x,y,v) . 
x>y 


On the other hand, due to the monotonicity of (v } it follows that 


v (y) > Min L(x,y,v n ) 
?>x>y 


> Min {lim L(x,y,v n } . 
?>x>y n "°° 


(43.4) 
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Using the LEBESQUE theorem of monotonic convergence, one may bring the 
limit value under the integral and obtain 


v(y) > Min {L(x,y,v)} = Min {L(x,y,v)} . (43.5) 

5>x>y X ^ y 


Together with (43.4) we obtain 


v (y) = Min (L(x,y,v)) . (43.6) 

x>y 


4. v(y) is the unique solution of the BELLMAN functional equation (43.6). 


5. The sequences (s n ), (S n ) are restricted within [s, s], [S, 5]. These contain 

convergent partial sequences. Each limit s and S of these partial sequences 
describes an optimal ordering rule for the stationary model. 


§44 A METHOD FOR COMPUTING s AND S 


Discounted Case 

We write the Principle of Optimality in the form: 


/x 00 /\ 

f(y) + P \ v (y - u)dP(u), for y > s; 
0 

A 00 A 

k + f (S ) + p j v(S - u)dP(u), for y < s. 

o 


The functional equation defined for v can be derived iteratively as follows (let the 
initial inventory be S): 


v(S) = Expected Cost + Expected Cost + Expected Cost + ... . 
in Period 1 in Period 2 in Period 3 
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As long as the stock does not drop to nor below s, i.e., as long as the accumulated 
demand does not reach nor exceed the amount D, only the expected inventory and 

A 

shortage costs f are incurred. 


The fixed ordering cost k and the subsequent costs v(S) are incurred at y < s. Let 
p x ^ : Probability that the demand u = x occurs exactly within n periods; 

p( n )(x): Distribution function of demand in n periods, i.e., the probability that 

the demand u < x occurs at most within n periods. 

P(n) is the n-fold convolution of P 

p( n )(D) = f P(D-u)dp( n-1 )(u). 
o 


Furthermore, let 


q( d )(D): 


Probability that the accumulated demand reaches at least the amount D 
exactly in the (n + l)th period 


D 


q( n >(D) - / [1 -P(D -u)]dP (n ^(u) 


0 


(44.1) 


Then 


D - 2 D ~ 


( 2 ) 


v(S) = f (S) + p f f (S — u)dP(u) + p j f (S — n)dP^ ' (u) + ... 

o o 

S- - ' 

y stays above s in Periods 1,2,3,... 


(i) 


+ p [k + v(S)] ; (D) + /[k + v(S)]Q v i (D) + .. 


( 2 ) 


■*v 


y < s in Periods 1,2,3,... 
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Using the following definition for the null convolution 


du : = 


u 


1 du, for u = 0 ; 
0 , for u > 0 ; 


we have 


D 


f(S) = />°]f(S)dP (o) u , 
0 


and obtain 


00 


D 


V (S) = ^ P n I f(S-u)dp( n )(u) + [k + v (S)] £pV u )(D) . 
11=0 0 


Under the assumption that it will be ordered before shortage occurs, i.e., 


v( 0 ) = k + v(S) 


(44.3) can be solved for v(0), 


00 D 

k + ^ p n f f(S - u)dp( n ^(u) 


v (0) = 


n=o 


o 


00 


1 - lA (n) (D) 

n=l 


We now reduce the terms Q^ d \d) in the denominator, n = 1, 2, : 
distribution functions P^ q \d). From (44.1) we have 


q( n )(D) - |dp( n ^(u)-/P(D-u)dP^ n 1 ^(u) 


o 


0 


= p( n ! )(D) -pW(D) . 


(44.2) 


(44.3) 


..., into the 
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Therefore the denominator becomes 

00 00 

1 - J (D) = 1 - J / +1 [p( n ) (D) - p( n+1 ) (D)] 

n=l n=o 

00 

= 1 -/9p(°)(D) - l P^(D) (/ +1 -/> n ) 

n=l 

00 

= 1 -/>p(°)(D) + (1 -p) J p n ?^(d) 

n=l 

00 

= (1-/0 l /p (n) (D) , 

n=o 

and for v(0) we obtain 


(44.4) 


Minimization of the objective function 

A 

Min v Q n (0) 

S,D 



results in the optimal values S and D, s = S — D, of the stationary (s, S) policy. 



192 


CHAPTER 5: STOCHASTIC MODELS WITH PERIODIC REVIEW 


I Inriiscnunted Case 

A. A 

In the undiscounted case, v := 1 im v n is known to be unbounded. The average cost 

n->oo 

per period, C or c, now enters as an optimization criterion instead of the present value 
of total cost if one ignores the proportional ordering cost. In §21, it was shown that 
the following relationship arises between these two criteria (21.3) 


c = lira (1 -p)v 
/>->! H 


5 


C = lim (1 -p)v . 
/)->! H 


Applying this to the objective function v^ in (44.4) one obtains the cost criterion for 
the undiscounted case 



(44.5) 


Undiscounted Case with Discrete Demand 

Previously, if we had used average costs per period (i.e., per unit time) in the 
stationary case, then the state probabilities were always considered. We again have a 
similar case. However, it is not obvious from the above objective function (44.5). We 
can reformulate it in the case of discrete demand so that only the state probabilites 
arise instead of the convolutions. 


Firstly, (44.5) is formulated for the discrete case. Let 



: Probability that the demand is u over n periods; 
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p( n )(D) : Probability that the demand is smaller than D over n periods. 


With these expressions, (44.5) becomes 


D—1 


00 


k + l f ( S -*) l Pj 


00 


c = 


u = o 


n=o 


00 


(44.6) 


J p( n )(D) 

n=0 


The state probabilities 7r 


y 


are now to be determined. 



Probability that the initial stock has the value y in the stationary case 


After placing an order, the initial stock between s and S may lie in the range 
s + 1 < y < S. 


We first consider the situation y = S. It can only exist if either the initial stock in the 
previous period was already S and no demand occurred 


T S p o ’ 

or demand has occurred in the previous period and because of it the stock fell to or 
below s; in this case, the probability is 

S 

J X y [! — p (y — s)] - 

y=s+l 

Since both of these events are independent of each other, the state probability ttq is the 
sum of both probabilities above 
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S 

T s = T s p o + l ^yC 1 - p (y-s)]• 

y=s+l 


(44.7) 


We now calculate the state probabilities ir , y < S. 


We assume that S — s > 2. An initial stock y < S may exist only in this case. We 
obtain the initial stock y < S from the initial stock y + u of the previous period and 
the demand u. 

S 

T y = S T i p i-y ’ for s < y < S • (44.8) 

i=y 


The solution of these equations (44.7) and (44.8) is obtained by the following 
consideration. The state y < S is reached from S for the first time after n periods with 
probability 


P 



for 1 < y < s . 


Hence, 

00 

T y = T S 1 p «!-y ’ for s < y < S . (44.9) 

n=l 

With 


1, for u = 0 ; 

0, for u > 0 ; 

the summation can start at n = 0 and using the normalizing condition 




INVENTORY CONTROL: MODELS AND METHODS 


195 


S 



y=s+l 


we obtain the solution 



, for u = 0 


s. 


(44.10) 


so that Tq can simply be written as 

1 1 
I — P 00 

J p( n )(D) 

n=0 


(44.11) 


If one substitutes both of these results in the objective function (44.6), one obtains the 
formula 


S ' 

c = k(l - P 0 )t s + l i f(y) 
_ y=s+l _ 


(44.12) 
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For a geometric demand distribution 


P U = QP U > 0 < p < 1 , q = 1 — p , ueM 0 . 


The generating function (compare §19) is 



and the generating function of the n-fold convolution is 

We now want to compute the state probabilities. By definition 

00 

l pWx" . [«(*)]" . 

u=o 


If one takes the summation over all convolutions, n = 1, 2, 3, ... with this equation, 
then one obtains 

00 00 00 

1 l ■ l [«<*)]* 

n=l u=o n=l 
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therefore 


00 00 


I U 

u=o n=l 


(n) x u 


■ l V 


u=o 


(44.13) 


Comparing coefficients, we have 


H 


w- 


n=l 


3* = constant 
P 


(44.14) 


Substituting into (44.9) results in 


T = Tq [1 — p ] ^ 

y S L F o J p 


x = Tgq = constant , s < y < S 


(44.15) 


From the normalizing condition 


S-l 


1 ~ T S = l 


y=s+l 


one obtains 


T S 'FT 


(44.16) 


T y - r +■(!)- TJq > s < y < S * 


(44.17) 


The objective function (44.12) then becomes 


c = 




s-i 


- i)q 


l i(y) 


y=s+l 
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/\ 

We define the inventory and shortage costs f as 

00 

f (y) - hy + g l (u-y)p u - (44.18) 

u=y+l 

The function f(y) is convex so that an optimal (s, S) policy remains guaranteed. 
However, it estimates the inventory cost in a most unfavorable way. 

With the geometric distribution we have 

y+1 

f(y) =hy+ S2—- . 

We substitute this value in the objective function and obtain 



/ 
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Determination of s: 

s is the smallest number for which the first difference of c in s is greater than or equal 
to zero: 


g B- (P “ 1) 

A s c - h + T + CD ; -“l)q • 

This is the same as 

p s >|[l + (D —l)q] >p s+1 . (44.20) 

We then obtain 


log | [1 + (D - 1) q] 
iog P 


(44.21) 


Determination of D: 

D is the smallest whole number for which the first difference of c in D is greater than 
or equal to zero. After some computational steps one obtains 


h[^ (D — 1) + q(D — 1) + 1] > qpk + gp 


s+1 


s +1 

If one substitutes p by the approximation (44.20), then one obtains 


D(D-l) = 


2pk 


qh 


Noting that p/q is precisely the expected value n for demand, we then have 



(44.22) 
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This result is similar to the Wilson Formula D = 



for the case with deterministic 


demand rate /i. 


The above results are based on the fact that the state probabilities t for y < S are 

identical (see (44.17)). This is correct only for the geometric and the binomial 
distributions. If n is small, then the state probabilities t are also almost identical for 


other distributions distributions and the formulas (44.21) and (44.22) serve as good 
estimates. 


§45 AHM-MODEL WITH DELIVERY TIME 

We previously assumed that the stock at the beginning of a period can be replenished 
immediately. This is an ideal which can only be justified for long periods. As a rule, 
one must reckon with delivery times even if the goods come from the same firm. We 
describe delivery as encompassing a number of individual activities from dispatching, 
to loading, transporting, unloading, receiving inspection up to storage, all of which 
require a certain amount of time. 

We therefore generalize the AHM Model for the case without reliable delivery times. 
It will be shown that the expected model does not go beyond the framework of the 
AHM—Type. It is, however, only possible (with only a single exception) at the cost of 
increasing the state space by at least one dimension. Let 

t : Delivery time 

Basic for all models with delivery time is the consideration that costs are related to the 
point in time when the ordered quantity is eventually delivered. One does not have 
any influence on the status of inventory before this time because a present action only 
has an effect after r periods. 
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Stock 

h 


t 

Order z 


Stock 
y t +z -h(t) 

Time 


t + r 
Delivery 


Figure 45.1 

The quantity in the above figure y t + r = y t + z - u(r) is correct if one adds the 

on-order quantities to the physical stock. This is reasonable, since these quantities are 
available at time t + r, while they are still on-order at time r. Hence, we describe the 
models with delivery time 

y : Stock on-hand plus on-order 

z: Order quantity 


Case 1: Delivery Time r = 1 

The order quantity is delivered at the start of the next period. 


BACKORDER Case: 



(y) = Min (k<5(z) + f(y) + p 
z >0 


00 As. 

f v n _i(y + z-u)dP(u)}. 
0 


Since z = x — y, it then becomes 



(y) = Min {k<5(x — y) + f(y) + p 
x >y 


J v n -t( x “ u )dP(u)} • 
0 


(45.1) 
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LOST-SALES Case: 


v (y) = Min {ktf(x-y) + f(y) +/> J v j(x-u)dP(u) 
x>y 0 n 1 

+ /» v n _l( x - y )[l- p (y)]} • (45.2) 

It is to be noted here that shortages may occur even before an order arrives. 
Therefore, shortage is given by y - u <0 and not by x - u <0. Since x > y, shortages 
occur with a greater probability as compared to the model with delivery time r = 0 

l-P(y) >l-P(x) . 

This implies that the stock fluctuation becomes larger for models with delivery time as 
compared to models without delivery time. This also applies if the delivery time is 
fixed, i.e., reliable. The delivery time makes inventory more expensive (i.e., if g > h). 


Case 2: r = 2, 

If the delivery time lasts for two periods, one has to take note of the quantity of the 
previous period. Let 

y : Stock on-hand plus the remaining order quantity two periods before 

Zj : Order quantity from the previous period 

z : Actual order quantity, z = x - y. 

The decision about the present order quantity depends on the value of both states y 
and Zj. The principle of optimality in the BACKORDER case is 


~ . 00 a 

v (y,z 1 ) = Min {k<5(z) + f(y) + p J v. (y + z. -u,z)dP(u)} , 

Z 0 


(45.3) 
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in the LOST—SALES case 


v (y.Zj) = Min (k<5(z) + f(y) + p ) v n _i(y + z i ~ u > z ) dP ( u ) 

Z 0 

+ P v n _ t ( z p z )[l -P(y)]} • (45.4) 

Case 3: r = m 

In general, let the delivery time r be m periods long, m e N. 

Let 

y : Stock on-4iand plus the remaining order quantity of the previous m th period 

z. : Order quantity of the previous i period 

A single variable representing the total remaining orders of the previous periods is not 
enough to describe the state of the system. To be able to explain the development of 
the stock, y t —> y t + ^ eac ^ single order, z^, must first be noted and will be used only 

upon the delivery of the latest order z m . We therefore have a vector of m states 

(y,z 1 ,z 2 ,...,z |n _ 1 ) . 

These are written as a recursion according to the formula 

y -* y + z m _j — u (Inventory Balance Equation) 

(z 1 ,z 2 ,...,z |n _ 1 ) -> . 
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The Principle of Optimality is 
in the BACKORDER— Case: 

v n(y ,z l ’'' ‘ ,z m—l) = Min { k( H z ) + 

z 

* 00 a 

+ f(y) + pj v n _ 1 (y + z m _ 1 -u,z,z 1 ,...,z m _ 2 )dP(u)} , (45.5) 

0 

in the LOST SALES Case: 

A 

v n (y,z 1 ,---,z in _ 1 ) = Min (k<5(z) + 

z 

~ y - 

+ f (y) + P f v n _i(y + Zm—i — u,z,z 1 ,... ,z m _ 2 )d p (ii) 

A 

+ / ,v n _l( z m _l> z > z p--- ) z m _2)[ 1 - p (y)]} • (45.6) 


Case 4: Delivery time r is not discrete. 

The delivery time r need not be an integer. Nothing is changed in the 
BACKORDER-Case for 0 < r < 1 and m — 1 < r < m. The formulas (45.1), (45.3) 
and (45.5) are still applicable. We just round off the delivery time. There is, however, 
a difference in the LOST—SALES Case. 

Since the last pending delivery has already arrived before the end of the present period, 
it can still be added to the saleable stock. Then, instead of (45.6), it must now be 

v n (y,z 1 ,...,z m _ 1 ) = Min {k<5(z) +f(y) 

z 


y +z ro-l 

+ p f 

0 


n-l ( y 


+ z i — 
m—1 


u,z 1? 


m—2 


)dP(u) 


A 

+ P v n _i( 0 » z > z p 




ft ft ft 


)[1-P(y+ z 


(45.7) 
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The number of possible states for r = 2 is already large. Hence, an approximation is 
also needed here. 


A pproximation 

We revise the period length so that it is identical to the delivery time. We then have a 
model with a delivery time of one (long) period. This rescaling of time changes the 
distribution of demand. For a delivery time of r = m periods, the demand u^ + u 2 + 

... 4* u m occurs as total demand over m periods. Since demand in the individual 

periods is assumed to be stochastic as well as independent, the distribution function of 
total demand is the m-fold convolution of the distribution function P(u). 

P< m >(u) : Distribution function of demand within the period r 

(for delivery time m). 

The holding and shortage costs are then defined as 

f( m )(y) = (h + g) jf p( m )(u)du + g(m/i-x) 

o 

and the value function satisfies the recursion in the BACKORDER-Case: 

v n (y) = Min (k<5(x-y) + f^ m ^(y) +p f v . (x - u)dp( m ) (u)} , (45.8) 

x>y o 

and the LOST-SALES Case: 

v (y) = Min (W(x-y) +f^(y) + p] v n _ t (x - u)dp( m ) (u) 
x>y 0 

+ p v-i( x - y) t 1 _p ^( y )]} • 


(45.9) 



206 


CHAPTER 5: STOCHASTIC MODELS WITH PERIODIC REVIEW 


For this approximation model, the following is to be noted: By lengthening the period, 
S, as well as D, will become larger. Thus the quantity-dependent costs increase. The 
fixed ordering cost is not affected but remains constant. Therefore, the lengthening of 
the period has the effect of relatively reducing the fixed ordering costs and should be 
checked in individual cases whether one is better off without the fixed ordering cost in 
the model. In case m is large and k is small in the original model, a model without 
fixed ordering cost is suggested as an approximation. In this case, a S(m) policy is 
optimal with 


S 



= P W 1 (_-£_) . 

h + g 


(45.10) 


As much as possible, one should avoid lengthening the period because it increases the 
variance of demand, which again drives the inventory costs higher (compare §38). 


Stochastic Delivery Time r 

We consider here the simplest case: only one order remains at the most. Let 

: Probability that the delivery time is equal to r 

c Kr): Distribution function of delivery time 

•A. 

v n (y,r,Zj): Value function with a planning horizon of n periods, stock y and 

remaining order quantities zl since r periods. 

r is now an additional state value. 


For the functional equation of the value function, one needs the transition probabilities 
<p + i: Transition probability from the state "still remaining after r periods" to 

the state "delivered in period r+1". 


INVENTORY CONTROL: MODELS AND METHODS 


207 


Then 


^r+l = U(t) ’ 

Vl = Qi . 


r > 0; 

(45.11) 

r = 0 . 

(45.12) 


As long as an order is still in transit no new order may be placed. The next order is 
allowed only when z^=0. Hence, the Principle of Optimality is now contained in both 

equations, for the BACKORDER Case, 

/\ a 00 

v n (y> z i> r ) = f(y) + p<p t+1 J v n _i(y + z< -u,o,o)dP(u) 

0 

oo - 

+ />[1 — ^ r+1 ] / v n _iCy — u,z-,r +l)dP(u) (45.13) 

0 


oo 


v n (y,0,0) = Min {k<5(x-y) + f(y) + p<p. j v (x - u,0,0)dP(u) 

x>y 1 0 


00 - 


+ 1 v n _ 1 (y-u,x-y,l)dP(u)} 

0 


(45.14) 


and, for the LOST SALES Case, 


v n (y> z i> r ) = f (y) + pv t+ i J v n _!(y + z i -u,o,o)dP(u) 

0 


+ w r+1 [i - p (y)] v n _i( z i>°>°) 


y - 

+ p[ l ~9 T+ 1] J v n _ 1 (y-u,z p r+ l)dP(u) 

0 


+ p[! -K> t+1 ] [1 -P(y)]v n _ 1 (0,z 1 ,T+ 1) , 


(45.15) 
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v (y,0,0) = Min {k<5(x — y) + f(y) 

x>y 

+ PV 1 J v n _ 1 (x-u,0,0)dP(u) 
0 


+ ptp x [\ -P(y)] v n _ 1 (x — y ,0,0) 


+ p[l-^] j v n _ 1 (y-u,x-y,l)dP(u) 

0 

+ /»[ 1 -^ 1 ][ 1 - p (y)] v n _ 1 (0,x — y, 1) } (45.16) 


§46 AUTOCORRELATED DEMAND 

The Planning Research Corporation of the US Navy has already accepted the AHM 
Model with constant expected value of demand. A slide rule whereby the (s, S) policy 
can be determined was developed by M.J. BECKMANN . The parameters are /i, g/h 
and k. (See §38: Standardization). Models whose solutions required a stochastic 
dynamic program were at that time computationally cumbersome. However, the 
assumption of a stationary demand distribution is very unrealistic for some 
applications. With the increasing performance of the computer, it will be easier to use 
more realistic but also more computationally intensive inventory models. 

In this section we handle the case where the demand u is influenced by its past history. 
We define a conditional density function as 

p(u)du = p(u|upu 2 ,. •. ? Uj c )du 
Uj: Demand in the previous ith period 

We specially consider the MARKOV Case 


p(u)du = p(u|uj)du . 


(46.1) 
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The last observation u, is taken to be the state variable in the optimality equations. 
These are 

v (y,u.) = Min {k<5(x — y) + a(x-y) + ffc.Uj) + 
x>y 


00 

+ P \ v n _ 1 (x-u,u)dP(u|u 1 )} , n = 1,2,... , (46.2) 

0 

whereby 

X oo 

f(x,Uj) =h / (x — u)dP(u|u^) + g / (u —x)dP(u|uj) . (46.3) 

—oo X 

What does one obtain by including the last observation? To answer this question, let 
us look closer at the MARKOV Process. Let 

1) p(ti|uj)du = '£(u-/i(u 1 )) , 

i.e., the expected value n is dependent on the last observation 

2) fi linear: H - H 0 ) 

fi Q long-term average 

One of the more frequently used demand processes is the resulting special case of the 
autocorrelated process of the first order, the so-called AR(1)—Process. The process 
equation is 


u 


t 




-** 0 ) + e t ’ 


(46.4) 


where \a\ < 1, for all t independent and identically (0, cr^)-normally distributed 
with distribution function tf(e). Then 
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k-1 

u t = ° k ^t-k-^ + I a \-i + v 

i=o 

In the steady state, i.e., for t->oo 



l “Vi^o 

i=o 




E <\! - E{ tt } = „ 0 , 


(46.5) 


00 


= £ {[ l al h-i + ~ ^ 


1=0 


1 2 
7T ( 7 

2 e 


1 - a 


3 > <?“ for | a| < 1 


(46.6) 


Note that for | a| <1, the unconditional process {u { } is also stationary with E{u { } = 

H Q and E{[ut — H 0 f} = a \- But by including the last observation u, in memory, i.e., 

according to the formulation of the AR(1) Process, the variance of demand becomes 
smaller in the present period as was shown in (46.6). 
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§47 INVENTORY WITH FORECASTING 

The introduction of the AHM inventory model can sometimes fail in models which 
imply a stationary demand process. However, information about the future course of 
demand usually exists from which short-term forecasts can be made. These forecasts 
must be considered in the model. One finds such situations, for example, if supply 
agreements are settled with a major client. 

In practice, one proceeds in such a way that a demand forecast is made in the first 
step, a safety stock level is determined in the second step, the forecast as deterministic 
demand is determined in the third step and, using a deterministic model, the optimal 
ordering rule is computed. This stepwise procedure, however, leads to solutions which 
are, as a rule, sub-optimal. 

Optimal solutions are obtained if the forecasts are integrated in the dynamic 
programming method. This requires a reformulation of the optimality principle. 

We differentiate two types of forecasts: the exogeneous and the endogeneous forecasts. 
In the endogeneous forecast, the forecast values are solely derived from the previously 
observed demand. The autoregressive scheme is a special case. We summarize the 
preceding observations u,, u 2> ••• ’ n a sufficient statistic and compute using the 

conditional density 

p(u|wj) 


instead of the conditional density 

p(u|u 1 ,u 2 ,...) . 

The information extracted from the past values ul, u2, ... are considered as a forecast 
for the demand u occurring in the present period. If the exact value of u is known at 
the end of the period, a new forecast can be computed with the help of the forecast 
formula 
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w = g(u,v 1 ) (47.1) 

It is important that the new forecast w can be derived recursively from the old forecast 
Wj and the latest observation of demand u. It is then possible to integrate the 

forecasting process in the optimality principle. It is given as 
v (y,w.) =Min { k <5(x-y) +a(x-y) +f(x,v 1 ) + 

X 

00 

+ p 1 v n _i( x - u > w ) dp ( u l w i)} • (47.2) 

0 


Example: First Order Exponential Smoothing 

The sufficient statistic w is a weighted average of all observations u^, ... whereby 

the observations preceding k periods are weighted by the factor a , 0 < a < 1: 

p(u|u 1 ,u 2 ,...) 

00 

”l = (!-«) \ u k+l ftk ( 47 - 3 ) 

k=o 

The forecast formula is (compare §6) 

w = aw. + (1 - a)u . (47.4) 

If one formulates it depending on t, i.e., 

w t + i = ftw t + ’ 

then one observes that in the stationary state 


w = u 
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and, hence, w can be reasonably interpreted as a forecast of a non-periodic stationary 
process. 

The forecast is unbiased for the stationary demand process {uT: 


E{w} = (1 -a) 




k=o 


= /* = E{u} 


and has a variance 


Var{w} = (1 — a)^ ^ ^\ 

k=o 





which is smaller than & 


u 


Conditional Expected Value as Forecast 

Indeed, the above forecast is unbiased but does not have a minimum variance. 
Forecasts with minimal variance are obtained if one chooses the conditional expected 
value as the forecast formula which is the case in an autoregressive scheme. 


Exogeneous Forecasts 

In the exogeneous forecast, the source of information lies outside of the model. This 
situation exists, for example, if the forecast and stock control functions are done in 
different departments of a firm. The forecasting department makes its forecast data 
based on business and economic data. For the inventory manager, the forecast takes 
on the character of a random variable: 
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Wj The latest forecast. It is based on the demand of the previous period, 
u has the density p(u|wl) du. 

w is the forecast still to be made for the future period, w is, in the eyes of the 
inventory manager, a random variable, which is hidden from him by the 
forecast mechanism. 

<p(w)dw is the density of w. 

The dynamic program is then 

V (y,w.) - Min {k<5(x — y) + a(x - y) + f(x, v^) 
x>y 

+ P \\ v n _ 1 (x-u,w)p(u|w 1 )^(w)du dv}. (47.5) 

The advantage of this model as compared to the model without a forecast is that w^ 

now allows the distribution of demand to concentrate more on the short-term expected 
value (in case the forecast is good!). This advantage, however, can again be partially 
lost since w introduces new uncertainties into the model. This is expressed in the 
double integral in (47.5). The double integration smooths the cost difference between 
the favorable and the unfavorable states. The cost curve v becomes flatter. 


Reduction of the State Space 
Under the two conditions 

Cl) p(u|w^) = (f{u - Wj) = (/p(e) with constant variance 
C2) w independent of u and w^ 

the dynamic program can be formulated with only one state variable. 
We rewrite the Principle of Optimality in which we use the variable 

6 : — u — W| (= Forecast error) 
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i.e., 

vJyjVj) = Min {kfftx-Wj-(y-wj)] + a[x - w j - (y - «j)] + 
x>y 

X-W 1 oo 

+ h / i (x-w 1 -c)j?(c)dc + g / [c - (x-w 1 )]p(e)de 

—oo X—W | 


+ P jj Vj^fx- Wj - e,w]p(e)tf(w)de dw} . 


(47.6) 


We consider the new state values 


r := y — w 1 ; . (47.7) 

£ := x — Wj ; (47.8) 


where r and ( are net inventories, i.e., inventories which exclude the estimate w^ of 
demand u. 


r : Net beginning inventory before the order 

( : Net beginning inventory after the order, i.e., £ = r + z 


With these new state values, (47.6) becomes 






r 


Min {k<$(( — r) + a(£ — r) 

£>r 


+ h j (f-e)p(e)de+g J (c — ^)v?(e)d(e) 

-t» £ 


+ P Jf v n _iU - e,w]>?(e)4'(w)de dw) . 


(47.9) 
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The right-hand side no longer depends on w^. Therefore, one can give up the second 
state variable w, and the Principle of Optimality can be formulated into a single state 
variable as " net inventory". 


(47.10) 

How realistic are these two conditions Cl and C2? 

First, the normal distribution satisfies Cl, since 

(u-w ^ 2 

n(u|w 1 ) =—-—e ^ ° =n(u-w 1 ) . 

1 (2 to 1 

2 

A constant variance a is also required in Cl. As a rule, this is accepted for goods 
with a very low market growth. For a higher market growth , would also increase. 

Second, the forecasts w and w^ should not be autocorrelated. 

Third, the demand u should not influence the forecast. Therefore, goods whose output 
are representative of a key industry are excluded. 

For example, if w is the change in the gross national product, then the above method is 
applicable for goods which follow the acceleration principle, e.g., investment goods and 
spare parts. 
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Exogenous Forecast with Self-adaptation 

For certain goods which do not fulfill the above conditions, the forecast w will depend 
on the old forecast w^ and on the immediate demand u. w then has a conditional 

density 


®(w|Wj,u)dw . 


The Principle of Optimality can now be written in the form 


v_(y,w,) = Min {k<5(x — y) + a(x-y) + 
x>y 


X 00 

+ h f (x-u)p(u|w 1 ) +g j (u-x)p(u|w 1 ) + (47.11) 

—oo X 


+ P ff v n _[ (x — u,w)p(u,w^)'f(w|wpu)du dw} . 


In this manner, an adaptive mechanism is integrated in the dynamic program. 



CHAPTER 6: 
NUMERICAL METHODS 


In the preceding chapters, a number of basic inventory models - especially the 
stochastic models — were introduced. Most of these models, however, must be 
modified to fit the requirements in actual practice. Because of this, the models may 
change in such a way that the suggested solution is no longer suitable; for example, 
when considering multiple discount breaks and transportation costs. This chapter, 
therefore, introduces a number of numerical methods which help one to calculate very 
general models. The method due to FEDERGRUEN and ZIPKIN (1984) discussed in 
the last section is an exception. 


§48 VALUE ITERATION 

All inventory models, which can be formulated using Bellman’s Principle of 
Optimality, can be solved by the value iteration method. It involves the recursive 
evaluation of the functional equation of dynamic programming. 

General Method of Value Iteration 

Step 1: Start with v Q = 0 or another vector appropriate to the problem, n = 1 

and a termination criterion. 

Step 2: Compute v using the principle of optimality. 

Step 3: Termination criterion satisfied? 

No : Set n : = n + 1 and go to 2. 

Yes: Go to 4. 

Step 4: Stop. 
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A termination criterion can be set, 

a) in the case of a finite planning horizon, upon reaching the end of the horizon 

b) in the case of an infinite planning horizon: 

— a maximum number of iterations is reached 

— when an absolute increase falls below a given minimum 

I v n + l ~~ v n I < e abs 

~ when a relative increase falls below a given minimum 
I v n + l" v nl/l v n + ll < e rel 

— when the difference between successive increases falls below a given minimum 
(specially in an undiscounted case) 

I I v n + l~ v n * “I v n“ v n—ll I <r ' 


Infinite planning horizon 

The following considerations assume an infinite horizon. 

Inventory models with identical demand distributions in each period can be formulated 
as a homogeneous Markovian decision process. For numerical purposes, we assume the 
discrete case. 
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Let 


1 : 

d 

6 : 


State, i = 1,2,...N 

Decision, d- is the decision in state i 

Ordering rule 6 = (d^, ..., d n ); (6 is no longer the Kronecker symbol!) 
Transition probability from i to j with decision i 


Expected value of the single period cost, depending on state i with 
decision d- 


We set costs as negative values and obtain a maximization problem 


v (i) 

n v ' 


max 

d 


f N 
1 


(48.1) 


n = 1, 2, 3, v Q given, p < 1. 


or in vector notation 


v n = ".ax {a^ + ^v n _ t } 


(48.2) 


with 
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In a (s,S)-ordering rule, has the following form 




S s 


The bars 1 —. describe the distribution (p(l), p(2),...p(u max )) °f demand u. 

All other elements of the matrix are zero. 

For simplification we introduce the following notations 

a) in case the decision in each state is fixed and not maximized then the function 
w replaces the value function v, 

w n( i ) = a i + ^PijV-l(j) ’ 

J 

and we shorten this equation as follows 


w n (i) = U^i.'V.p 
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or in vector notation 


w n = L ( 5>w n- 1 ) ; 


by maximization, instead of 
V (i) = nupc {a'J +/> J pf i v n _ 1 (j)} 

n d 


we now write 


v n (i) = max l(d,i,v j) ; 
d 


or as a vector 


v n = maxL(^v n _ 1 ) ; 


and using 


U : = max L 

6 


the short form 


v = Uv„ 1 
n n—1 


is obtained. 


With this notation, the general method for value iteration in the discounted case, i.e., 
for j < 1 with an infinite planning horizon, is transformed into the following algorithm: 
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Value iteration in the discounted case, infinite planning horizon 


Step 1: 

Start with v q (e 0),c abs > 0. 

Step 2: 

Compute Uv (maximizing step). 

Step 3: 

|| Uv - V || > € abs ? 


Yes: Set v : = Uv and go to 


No: Go to 4. 

Step 4: 

Stop. 


The supremum norm || v || = max (v(i)} is used. 

1 


The above value iteration is sufficient for convergence if the largest absolute eigenvalue 
of the iteration matrix pP is less than 1. 


Lemma 48.1: The matrix p P has a highest possible absolute eigenvalue | A | max = p. 


Proof: 

T 

The matrix P has an eigenvector, (1,...,1) =• e , which corresponds to the eigenvalue 
1. Since an upper bound of A is given by 


\| <max ^ 


iJ 


J 


= 1 (Eigenvalue estimate) 


A = 1 is the greatest eigenvalue of P. Hence p is the greatest eigenvalue of pP. q.e.d. 

Thus the convergence of value iteration in a discounted optimization problem is, 
therefore, assured. 


Aside from this, 

N 

Lemma 48.2: For p < 1, L is contracting, i.e., for all u,v e IR and for all 6, 


| L(<5,u) — L(<5,v) | < p\ u — v | , 0 < p < 1 . 
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Proof: | L(<5,u) -L(M) I = I & S + p? ~ & 6~ p? 6 W I 

= I /^(u-v) II < P II II II U-V II 
- p\ u — v | . q.e.d. 

Next, it will be shown that the contraction property also holds for U. 

N 

Lemma 4&3: For p < 1, U is contracting, i.e., for all u,v t IR 

| Uu — Uv | < p | u — v |, 0 < p < 1. 

Proof: 

N 

Let i be any component of vector v (i.e., we take any state). Then for u,v c IR : (Uu)- 
= (Uv)j + k. Let k > 0 and let 5 be the maximizing decision with respect to u in i, 
i.e., 

(Utt) i = l(a,i,u) > (Uv) i = (Uu) i — k. 

By definition of U, (Uv)j > 1(3,i,v). Hence, 

(Uu)i = 1(3,i,u) > (Uv) • > 1(3,i,v) , 

whereby it follows that 

(Uu). - (Uv). < 1(3,i,u) - 1(3,i,v) 

<p\ U-V I , 

since L is contracting. This is valid for all i, hence 


| Uu — Uv | < p\ u — v | . q.e.d. 


INVENTORY CONTROL: MODELS AND METHODS 


225 


With this, the conditions of the BANACH Fixed Point Theorem are satisfied. It 
follows that a fixed point exists for each decision rule <5. This satisfies the fixed 

point equation 


v s = l(6,u s ) . (48.3) 

* 

In the same manner, there exists exactly one fixed point v as the solution to the fixed 
point equation 

v* = Uv* . (48.4) 

* 

Value iteration is one of the many possibilities for determining v . The use of a 
particular method depends on the extent of the computation. 

Let us examine the convergence behavior of value iteration. If one has already 
performed n iterations, the norm of the residual is reduced after R additional iterations 
by a factor of p since 

I v n+R“ v *l -^ R ! v n -v I ' ( 48 - 5 ) 


How many iterations R are needed to improve the current precision of this 
approximation v Q by a decimal point? That is, 

* 

I Vr - v I i 



From (48.5) one obtains 

R 1 
p = TU 

I R _ -1 
R ' 
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R is a constant. In that case we say that the method is linearly convergent. Looking 
at it from a numerical standpoint, such methods are tedious. This is true especially if 
the discount factor p lies close to 1. Consider, for example, the following values for a 
yearly interest of 10% 


Period 

p 

R 

Year 

0.91 

24 

Month 

0.99 

277 

Week 

0.998 

1198 


Acceleration of convergence in the discounted case with infinite horizon 
A number of methods can be used to accelerate convergence. 


1) Single-step iteration: 

Assume that we are computing v n + i(i)- For k < i, we can use the "better" 
value v n + ^(k) for the required summation instead of v n (k). This leads to the 
so-called single-step iteration. The recursion equation is 



max • 
d 



V. 


i-1 N , 

+ EpfjVlUW I Pij v n^j> • 

*4 • • 

J=1 J =1 


2) Further improvements can be brought about by the following variation: 

By maximizing with respect to d, the right-hand side of the equation is 
evaluated for different values of d. If an evaluation leads to an improvement, 
one uses the improvement w n + ^^) f° r next i terat i° n instead of v (i). 
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max 

d 


- 

4 + p l p ijVl (j) + PPii “ax{v n (i) l w n+ i(i)> 

j=l 

N 

+ P l p ij v n(j) * 
j=i+l 


3) Divided Form: 

Derivation: We iterate only in the ith—component. 


w n,l (i) : = 


w n,2( i ) : 


4 + p l p ij w n (j) +/,p ii W n (i) 
j+i 

a i + p I p ij w n (j) +pp ii w n,l (i) 

j+i 


k-1 


w n,k (i ) := 


l4 + p I p ij w n^ I (Wii) r+ (W? i )\( i ) 


d sk 


j+i 


r=o 


Since /?p. • < 1 


; im Vk (i) = r~ r t4 + p l 4 • 

k->oo 5 1 — pp. . • i . J 

j+1 


This leads to the iteration procedure: 


w n : = C 1 -^ 1[a * + ' (P a, L + P $,U )w ii-d 


(48.6) 


whereby I is the unit matrix and the matrix is broken down into a lower 
triangular matrix P^, an u PP er triangular matrix P^u and a diagonal matrix 

P <5,D’ i - e > 
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Iteration (48.6) converges also to a fixed point w^ which can be shown easily by 
substitution. Hence, value iteration converges also in divided form 


v 

n 



max 

d 


1 


i - ^ 




j+i 


(48.7) 


* 

to a solution v of the optimization problem. Value iteration in divided form 
can also be performed in the above variations (1) and (2) which lead to an 
additional acceleration of convergence. 


Value iteration in the undiscounted case, infinite planning horizon 

With a fixed policy 6, the iteration w n = L(<$,w n _j) generates a non-convergent 

sequence starting with w Q e 0, 

+ P ^a ^ + ... + P ^ . 

The change An = w fl — w n _^ tends, however, towards a constant vector. It is 

V P S la <5 (48 - 8 ) 

and with stochastic matrices, the limit II ^ exists for each S 


lim P*j = II^ 

n-hM 


? 


such that the increase A„ also has a limit 

n 


lim A = lim P* ac . 
n do 

n-^oo 


n-Hx 


(48.9) 
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A is the stationary periodic return which is also called the average return. The 

numerical problem is that of determining the decision rule S which gives the highest 

* * 

average return A (A > A^ for all 6, element by element) 

* 

A = max A c . 

6 6 

This is done by the 

Value iteration in the undiscounted case 
Step 1: Start with v = 0, r > 0. 

Step 2: Compute v := Uv ; compute A , , : = v - v q . 

Step 3: Compute Uv; compute A : = Uv — v. 


Step 4: 

Is 1 II A old II 

-IIAeJH >r ’ 


Yes: 

^ et ^old ' - ^new 



and go to 3. 


No: 

Go to 5. 

Step 5: 

Stop. 



Basically, in the value iteration with infinite planning horizon, one obtains only an 

* 

approximation of an optimal value v . Hence, one can also never be sure that the 

* 

optimal policy 6 has been determined. It is possible that one can still achieve a better 
policy continuing the iteration procedure. 

However, there are methods which can recognize and select sub-optimal decision rules 
partly at the start and partly during the iteration (compare McQUEEN (1967), 
BARTMANN (1976)). If a single policy still remains then one can be sure that this is 
also optimal. 
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§49 POLICY ITERATION 

Let us return to the undiscounted case p < 1 with an infinite horizon. The method of 
policy iteration proceeds according to the following manner: 


Choose a decision rule 
compute w^ ; 

search for a decision rule by which improves w^ ; 
compute w^ ; 

search for a decision rule by which improves on w^ ; etc. 


In this way a sequence of fixed points w^ is computed which is monotonically 

increasing. Since only a finite number of decision rules exist in the discrete case, this 

chain stops at a maximum fixed point w r after a finite number of steps. 

m 


Wc < Wr < 

<§1 < 5 2 


< 


w <5_ 


m 


max w r . 

6 6 


While value iteration is advantageous if a starting vector can be given as a good 

* 

approximation for v (the BANACH Fixed Point Theorem shows that one can begin 

N 

with only a starting vector v Q c IR ), policy iteration is recommended if one determines 

* 

a good approximation for S as a starting decision rule. 

Policy iteration in the discounted case 
Step 1. Start with b. 

Step 2. Compute w^ as a solution to the system of equations 

= L(«5,w^). 
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Step 3. Test for optimality of 8: 

a: Compute Uw,. Let the maximizing decision rule be 6. 

b: Is H S? 

9 

Yes: Let 8 : = 8 and go to 2; 

* * 

No : Let 8 : = 8; v : = w r and go to 4. 

Step 4. Stop. 

* 

The termination criterion gives the optimal decision rule 8 (with value iteration, this 

* 

is not guaranteed!). It still remains to be proven, however, that the optimal value v 
of the dynamic program can be actually determined in this manner. 

Lemma 49.1: Among the optimal strategies of a Markovian decision problem of 

the above type with a finite set of decisions is a stationary 

* 

decision rule 8 . 

Proof: 

With an infinite planning horizon, an optimal strategy of the above problem is an 
infinite sequence ... ^ n <5 n+ ^ ••• of decision rules (exactly one for each). Either a 

stationary strategy ... 8 8 ... is already optimal or there exists a stationary 

A 

improvement <5 because of the monotonicity of L 
... 8 S 6 6 ... 

Since the set of decision rules is finite, there exists a stationary strategy for which 
there is no stationary improvement, q.e.d. 


Undiscounted case. 

The decision rule is a bit difficult in this case. We limit ourselves therefore to the 
so-called complete ergodic case. It implies that the state probabilities 



Probability that the system is in state i after period n by using policy 6. 
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are independent of the initial state for n -> oo. 

Let ?r n ^ = ( 7 r n (l), 7r n (2),...,^ n (N))^ be the distribution of the state probabilities of the 
system after n periods beginning with the initial distribution 7r Q g Then 


- T P U 

n,<5 T o 


(49.1) 


In the complete ergodic case, the limiting value is 


t c - 1 im j c 
o „ n, o 
n-*oo ’ 


= lim r .P 1 ] 
0.0 0 
n-Kx) 5 


(49.2) 


independent of 7r Q g Since 1 im P^ = II g then for (49.2) 

^ _V 


n-^oo 


vAr-V 


(49.3) 


Since this relationship must also be valid for the improper initial distributions 
(1,0,...,0), (0,1,...,0,0), ... (0,0,...,0,1), it follows that the matrix has identical rows. 

Then the average return A becomes a vector with identical components (48.9) 


a^ — a^c j 


(49.4) 


e A = (!,..., 1). The average return is then independent of the initial state. It is a scalar 
of size a g 
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a^: Average return (stationary periodic return) of a fixed undiscounted ergodic 

state process; 

The total return v n (i) asymptotically satisfies the relationship 

v n (i) = na + V(i), n very large . (49.5) 

Let 

tfi 

V^: Difference between the expected total return of the n periods and the n in 

average return na 


V: Lim V 

n->oo 


n 


The expected return in a single period starting in state i is a^ and for any period n 


a 


• + V p. -a- + ... + Y ^a- . 

1 L *ij j L v ij j 

• • 

j j 


We write the difference in the form 


v n (i) = a i + \ p^aj + • • • + Jp-j ^a, - (n- l)a-a 


J 


J 


= a i + 1 PikK + 1 Pkj a j + • • • T pft 2)a j) - ( n 

k j 

= a i + I p ik v n-l^ k ^ _ (n — l)a — a 
k 

= a i + I p ik[ v n-l( k ^ " (« - !)a]-a 

k ^ y 


— l)a — a 
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whereby the following recursion is finally derived 

v n (i) + i = a i + Jp ij v n _ 1 (j) . (49.6) 

J 

For n -• oo, this becomes 


V(i) + a = a i + J P i jV(j) . 
_J_ 


(49.7) 


Hence, the resulting value function V measures the total deviation of the total returns 
from the accumulated average returns. The values V(i) in (49.7) are determined for all 
but one common factor. For normalization, we set any component to zero, e.g., 
V(N) =0. The other values V(j), j t i and a are then computed by solving the system 
of equations (49.7). 

How is the best decision rule determined? 

It must maximize the stationary single period increase. 


For a planning horizon n, it is required to 



It can also be written as 


r 


max • 
d 




(49.8) 
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The maximizing d remains the same even if one subtracts the term na (it is 
independent of d). The test size is then 


max 

d 



+ 


y P d .v 1 

ij n—1 





and for an infinite planning horizon 


r 


max • 
d 




(49.9) 


Policy iteration with o = L complete ergodic case 
Step 1: Start with decision rule S. 

Step 2: Set V(N) = 0; 

compute V und a as solution to the system of equations 
V + ae = a r + P rV. 

Step 3: Test for optimality of 6: 

a:Compute max (a^ + P^V}. Let the maximizing decision rule be S'. 

6 

b: Is <5+ S' ? 

Yes: Set S : = S' and go to 2; 

* — * — 

No: Set S : = S; a : = a and go to 4. 

Step 4: Stop. 
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550 BISECTION METHOD AND DYNAMIC PROGRAMMING 

The policy iteration has the disadvantage of poorly estimating a priori the extent of 
the computation. The disadvantage of value iteration lies in its very slow convergence 
if the discount factor is close to 1. Let us, therefore, look at a third method: the 
Bisection Method in connection with dynamic programming (BARTMANN (1979)). 

The bisection method is applied to determine a zero element of a real-valued function. 
The interval containing the zero element is continually shortened by halving until its 
length falls into a specified stopping limit. To shorten an interval into one-tenth of its 
size, aproximately 3.32 bisections are needed. If the bisection method can be applied 
successfully to a Markovian decision process with an infinite planning horizon and 
discount factor p < 1, the computational effort will be independent of p and shorter 
than in value iteration as long a s p > 0.5. 

The problem of computing the fixed point v is now represented as a problem in R . 
An interval 


— ~ 4 - 

V < V < V 

N 

is easy to determine; however, the bisection method does not hold since R is only 
half-ordered. This means that after the bisection step 

+ — 
v + v 

V B 2 

* _ 

not only v t [v ;vg] (situation 1) 

* + 

or (exclusive or) v t [vg;v ] is possible (situation 2) but situation 3 can also occur. 
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Situation 1: v lies in Situation 2: v lies in 

the left sub-interval the right sub-interval 


V" V* Vg v + 



Situation 3: v lies neither wholly in the left 
nor wholly in the right sub-interval 

Explanation of the above diagram: 

Each horizontal line means a real axis. Each component v(i) of v is placed on a 
specific axis. The combination of these individual values results in the jagged line as a 
representation of the vector v. 

To be able to apply the bisection method it must be modified. The whole method 
consists of 5 parts. 

* 

Part 1: Computation of a suitable starting interval which contains v 

Part 2: Bisection step. 

Part 3: Test which of the three situations arise. 
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Part 4: If Situation 3 occurs: Perform some maximization steps : = Uv ^ 

until monotony is achieved, i.e., 

v > v 1 or v < v v 
n ~ n—1 n n—1 

Then v fl becomes the interval partitioning vector, since it follows that 
either 

* * + 

a ) v „ > v „ i => v > v „ => v e [v_;v ] or 

7 n n—1 n L n J 

* * _ 

b) < v„ ! 3 V < v„ => v € [v ;v 1 . 

' n n—1 n 1 n J 

Part 5: Termination criterion. 

These parts, when combined, can be written as an algorithm. We formulate it for the 
standard situation "Maximization problem, all a- - < 0 

J 


Bisection Method and Dynamic Programming 
Step 1: Start with v Q = 0, e abs , 6 fel > 0. 

Step 2 : Compute v : = Uv Q . Let the maximizing decision rule be S. 

+ * 

Step 3 : Set v : = v (upper limit of v , since Uv < v). 

Step 4 : Compute w^ for 6 from Step 2 by solving the system of equations 

w s = L(<5,wp. 

Step 5 : Termination criterion: 

a) Check, if 6 is already optimal: 

A 

compute v : = Uw,; 

if 6 is again the maximizer, 

* / \ * 

set v : = v; 6 : = 6 and go to 14. 
if I v-w^l/l v | < ( rel , 

set v : = v; 8 : = 6 and go to 14. 


b) 
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Step 6 : 
Step 7 : 

Step 8 : 


Step 9 : 


Step 10: 
Step 11: 


Step 12: 


Step 13: 
Step 14: 


_ <*■ * 

Set v : = v (lower limit of v ). 

Bisection step: Vg := (v + + v“)/2 


Termination criterion: 



Yes: compute Uv, to obtain 6 , 

* * 

set v : = Uv; 6 : = 6 (maximizer of Uv) 
and go to 14; 

No : go to 9. 

Test, which situation arises: 

Compute Tv (corresponds to Uv with possible computational reduction) 

a) if Tv > v, set v~ : = v and go to 7; 

b) if Tv < v, set v + : = v and go to 7; 

* 

c) if Tv = v, set v : = Tv and go to 14; 

d) otherwise, go to 10. 

Situation 3 arises: 


Compute v : = Uv. Let the maximizer be 5. 
Termination criterion: 


If I v-v|/|v| 


< f rel’ set v * := v; S * := 6 


and go to 14. 

Test, whether monotony arises: 

/V A * 

a) if v > v, set v : = v; 6 : = 6 
and go to 7; 

b) if v < v, set v + : = v; 6 : = S 
and go to 7; 

c) otherwise, go to 13. 

A 


Set v : = v and go to 10. 
Stop. 


Explanation of iteration Tv in Step 9: To test the three situations mentioned above, 

one can perform a full maximization step Uv and compare v with Uv. In order to 

* * * ^ 

satisfy any one of the conditions: (a) v <v; (b) v >v; or (c) v > v, a full 

♦ 

maximization step is unnecessary in all cases. For example, to test for v > v, it is 
sufficient to determine a decision rule 6 which results into an improvement L(<5,v) > v. 
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(Note: This is a maximization problem, hence " >" is used). One need not look for the 
best rule. Similarly, to determine the occurrence of Situation 3, it is sufficient to 
determine two components i, j such that Vj < (Uv). and Vj > (Uv)j. The other 

components need no longer be tested anymore. One can therefore define Tv in the 
following manner. 


Iteration Tv: 


First Component: 

For the valid decisions in state 1, compute the size d(d,l,v). As soon as l(d,l,v) > v(l) 
for a given d compute for the other components. 

Remaining Components: 

a) Let a d be found for which l(d,l,v) > v(l). One can stop computing for the 
other components i as soon as one determines a d such that l(d,l,v) > v(i). If 
no such decision exists in a state j > 1, i.e., max j(d,j,v) < v(j), then it follows 

immediately that Uv ^ v (not comparable) and one can stop the test. 

b) Let no d be found for which l(d,l,v) > v(l). Perform the full maximization step 
in the remaining components. However, if a d is found in a state i > 1 such 
that l(d,l,v) > v(i), then it follows immediately that Uv % v and one can stop 
the test. 

The above bisection method in combination with dynamic programming is a basic 
algorithm which allows numerous variations and fine-tuning. Numerical experience 
shows that this method performs a lot better than value and decision iteration. 

If a very special problem arises, methods tailored to such situations can be very 
effective. One example is described in the next section. 
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§51 COMPUTATION OF OPTIMAL (s, S)-POLICIES ACCORDING TO 
FEDERGRUEN/Z3PKIN 


The method of FEDERGRUEN/ZIPKIN for the computation of optimal (s, S) policies 
is related to the undiscounted standard AHM model with backorders. The distribution 
of U is in the form p^, p^, p^ p^,... We consider the model with proportional ordering 

cost. Let 


f(y) : Expected value of inventory holding and shortage costs of a period with 

starting inventory y 

y : Initial inventory (before a possible order) 

x : Initial inventory after an order 

x-y : Order quantity 

6 : Ordering rule of the (s,S)-Type 



if s + 1 < y < S ; 
if y < s; 


S 



F(x,y) : 



Optimal ordering rule 

Average cost for an ordering rule 6 

Minimum average cost 

Stationary state probability of inventory y with policy S 

Single period cost 
Fixed ordering cost 


F(x,y) = 


f (y )* tf x 

[ k + f(y), if x 


= y; 
> y; 


F/y) = F(%),y). 
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The Principle of Optimality is expressed using the following notation 


00 

v(y) + c* = F^(y) + J v[(5(y)-u]p u , (51.1) 

u=o 

for all y < S. By definition 

v(S) =0 (51.2) 

for normalization. If one wants to solve the functional equations (51.1) we must 
restrict the state space to a finite size, i.e., allow a smallest y = y^ such that 

^min- y- ^min serves 218 an absorbing barrier. Equation (51.1) is changed 

accordingly. The summation should be performed until 6{ y) — u = y^. . This 

restriction of the state space causes an inaccuracy in the model. 

The method of FEDERGRUEN/ZIPKIN avoids this. It is not based on the recursive 
evaluation of the functional equations but monitors the stock movement after 
inventory is filled up to S. 

We define 

t(w) : Expected time until the next order, if the current stock lies w units 

above the ordering point, w = y — S, w > 0. 
i/ s (y) : Expected costs until the next order, if the current stock is y, y > s. 

Both functions t and v satisfy the equations 


w-1 

t(w)=l+ ^p u t(w-u), w>0, (51.3) 

u=o 

y-s-1 

A 

2 / s (y) = f(y)+ \ p/ s (y-u), 

u=o 


y > s. 


(51.4) 
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t is independent of the (s, S) Policy and v depends only on s. The system of equations 
(51.3) has a triangular form: 

t(i) -i 

t(2) -1 
t(w) -1 

Similarly, the system of equations (51.4) has the same form. 

Starting with w = 1, t can be computed very rapidly. The same is true for v , starting 

s 

with y = s + 1. The essential advantage of the method lies in the fact that one can 
compute the values below with t and v 

_ ^ s (S) + k 
c <r t(S - s) 

^(y) + k- C(5 t(y-s), for y > 0 
k , for y < s 

Equation (51.5) is exactly the cycle cost v (S) + k per cycle. The validity of (51.5), 

(51.6) shows that these expressions satisfy the Principle of Optimality when 
substituted in (51.1), (51.2). Thus a fast method of policy iteration can be constructed 
with equations (51.3) to (51.6). 

Step 1: Initialization. 

Set limits s, S, S for the values s, S. 

/\ A 

s : smallest whole number such that f(s) < f(S) + k; 

A 

S : smallest whole number which minimizes f(y); 

A A 

5 : smallest whole number such that f(s) > f(S) + k; (compare § 42) 

Ut s old : = S old : = -1 

Choose a starting policy 6 = (s,S) and let s : = s; S : = S. 

new new 

Compute the function t(w), w = 1,2 ..., 5 — s from (51.3). 



(51.5) 

(51.6) 


= Po^ 1 )- w-1 

= P 0 t(2) +pjt(l) 

= P 0 t( w ) + • • • + Pw-^C 1 ) • 
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Step 2: Computation of the Value function. 

If s has changed from the last iteration (s j d i s new ); 

Compute v Ay), y = s + 1,... U from equation (51.4). 

Compute Cc and Vr(y), y = s, ..., 5 from equation (51.5), (51.6). 


Step 3: 


Policy Improvement. 

a) Save the old policy: s Qld : = s Qew ; S Qld : = S flew 

b) Compute the minimizing S’; S < S’ < S 



min 

S<y<5 



new 



c) Search for a better s: 

cl) in increasing order: s + 1, s+2,...,s; 

in case an order is worthwhile at state s + 1, i.e., if 

k + (S’) < v^s + 1), 

search further in increasing order until a y is found for which it is 
no longer profitable to order. Let this case be at inventory level 
rj. It must therefore follow that 

k + v^S’) < v^y) for all y, s < y < r) - 1. 


&t s new : = ^ - L 
Go to Step 4. 
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c2) in decreasing order: s-1, s-2, s; 

in case an order is not worthwhile at state s—1, i.e., if 

f(s - 1) < c St 

search further in decreasing order until a y is found for which it is 
profitable to order. Let this case be at inventory level £. It must 
therefore follow that 

f(y) < Cr for all y, £ + 1 < y < s. 

Go to Step 4. 

Step 4: Termination Criterion. 

^ ^old - ^ s old ’ ^old^ ^ ^new - ^ s new ’ ^new^’ ^ et ^new ' ~ ^old ant ^ 
to step 1. If ^old " 5 new then go to 5. <5 Qew is the optimal policy. 

Step 5. Stop. 

All methods discussed in this chapter (exluding value iteration) for inventory problems 
of realistic sizes can be computed within seconds on a personal computer. 


CLOSING REMARKS 

Inventory theory is by no means exhausted. A book can also never hope for 
completeness. It only presents a selection and thus is quite subjective. Our objectivce 
was to present the most important and typical methods and models in order to 
stimulate the reader to reflect on his own. 
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